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DRUG METABOLIZING ENZYMES 



TECHNICAL FIELD 

This invration relates to nucleic add and amino acid sequences of drug metabolizing 
5 ens^ymes and to the use of these sequences in the diagnosis, treatment, and prevention of 

autoimmuneAnflammatory, cell proliferative, neurolo^cal, developmental, endocrine, eye, metabolic, 
and gastrointestinal disorders, including liver disorders, and in the assessment of the effects of 
exog^ious conq)ounds on the exp9cession.of nucleic acid and amino acid sequences of drug 
metabolizing ^ozymes. 

10 

BACKGROUND OF THE INVENTION 

The metabolism of a drug and its movement through the hody (pharmacokinetics) are 
iinportant in detCTminiiig its effects, toxicity* and i^ The three processes 

governing phannacoldnetics are the absoiption of the drug, distribution to various tissues, and 

15 elimination of drug inetabolites. These processes are intimately coupled to drug metabolism, dnce a 
variety of metabolic modifications alter most of the physicochemical and pharmacological {MToperties 
of drugs, including solubility, binding to recq[>tars, and excretion rates. The metabolic pathways 
which modify drugs also accept a variety of naturally occurring substrates such as steroids, fatty 
adds, prostaglandins, leukotrienes, and vitainins. The enzymes in these pathways are therefcne 

20 inqportant sites of biochemicsd and pharmacological int^caction between natural compounds, drugs, 
carcinogens, mutagens, and xenobiotics. 

has long been appreciated tiiat inherited differences in drug inet^ 
diSsrent levels of drug eCBcacy and toxicity among individuals. For drugs with narrow therapeutic 
indices, or drugs which require tHoactivation (such as codeine), these polymorphisms can be criticaL 

25 Moreova:, pronnsing new drugs are fiequmtly eliminated in clinical trials based on toxicities which 
inay only affect a segment of the patient groiq^. Advances in phannacogenoimcs research, of which 
drug metabolimig enzymes constitute an important part, are promising to exparkd the tools and 
information that can be brouglit to bear on questioris of dnig efiEcacy a 
and R.V. ReOing (1999) Science 286:487-491). 

30 Drug metabolic reactions are categorized as I%ase I, which functionalize the drug molecule 

and prepare it for fmrthernietabolism, and Phase II,wUch are conjugal In general. Phase I 
reaction products are partially or fully inactive, and Phase II reaction products are the chief excreted 
species. However, Plme I reaction products are sonoetirnssrncve active than the origin 
administered drugs; this metabolic activation principle is exploited by pro-drugs (eg. L^opa). 

35 Additionally, some nontoxic coiiqx>unds (e.g. aflatoxin, benzo[a]pyrene) are metabolized to toxic 
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intennediates through these pathways. Phase I reactions are usually late-Jixmting in drug metabolism. 
Prior exposure to the conq>ound, or other conq)ounds, can mduce the expression of Phase I ^izymes 
however, and thereby increase substrate flux through the metabolic pathways. (See Klaassm, CD. et 
aL (1996) Casaiett and Doull's Toxicolopv: The Basic Science of Poisons, McGiaw-BBU, New York, 
5 NY, pp. 113-186; Katzung, B.G. (1995) Basic and Clinical Hiannacolopv, Appleton and Lange, 
Norwalk, CT, pp. 48-59; Gibson, G.G. and P. Sfcett (1994) fatroduction to Drug Metabolism, Blaclde 
Academic and Professional, London.) 

I>n]gn]etabolizmg«izymes(DMEs) have broad substrate s^^ Thiscanbe 
contrasted to the inumme system, where a laige and diverse population of antibodies are hig^y 

10 specific for their antig^. TheabiUtyofDMBstometabolizeawide variety of molecules creates the 
potential for drug interactions at the level of metabolism. For exanople, the induction of a DME by 
one conqx)und may affect the metabolism of another conqiound by the enzyme. 

DMEs have been classified according to the type of reaction they catalyze and the cofactors 
involved. The major classes of Phase I enzymes include, but are not limited to, cytochrome P450 and 

15 flavin-containing monooxygenase. Other ^[izyme classes involved in Phase I-type catalytic cycles 
and reactions include, but are not limited to, NADFH cytochrome P450 reductase (CPR), the 
microsomal cytochrome h5/NADH cytochrome b5 reductase system, the fmedoxm/Eerredoxin 
reductase redox pair, aldo/keto reductases, and alcohol dehydrogeoases. The major classes of Phase 
nenzymes include, but are not limited to, UDP glucuronyltransfi^rase, sulfotransferase, glutathione S- 

20 transferase, N-acyltransferase, and N-acetyl transfisrase. 

rytonhmTTift PAS Q and P4S0 c fltalyrir, ry de^associated mzymes 

Members of the cytochrome P450 superfamily of enzymes catalyze the oxidative metabolism 
of a variety of substrates, including natural c(mqx>unds such as steroids, &tty acids, prostaglandins, 
leukotrienes, and vitamins, as well as drugs, carcinogens, mutagens, and xenobiotics. Cytochromes 

25 P450, also known as P450 heme4hiolate proteins, usually act as tpgmmal oxidases in 

multi-component electron transfer diains, called P450-containing monooxygenase systems. Specific 
reactions catalyzed include hydroxylation, oxidation, N-oxidaticxi, solf ooxidation, N-, S-, and Q- 
deallgrlations, desuIEation, deamination, and rediKlion of azo, nitro, and N-^ These 
reactions are involved in steroidogenesis of ^ucocorticoids, Cortisols, estrogens, and androgens in 

30 animals; insecticide resistance in insects; herbicide resistance and flower coloring in plants; and 
enviromnMital bioremediation by microorganisms. Cytochrome P450 actions on drugs, carcinogens, 
mutagens, and xenobiotics can result in detoxification or in conversion of the substance to a more 
toxic product Cytochromes P450 are abundant in the liver, but also occur in other tissues; the 
©o^mf^ are located in microsomes. (See ExPASY ENZYME EC 1.14.14.1; PrositePDOCOOOSl 

35 Cytochrome P450 cysteine heme-iron ligand signature; PRINTS EP450I E-Oass P450 Qtoxsp I 
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signatme; Graham-Loxence, S. and Peterson, J.A. (1996) FASEB J. 10:206-214.) 

F6ur hundied cytochromes P450 have bera identified in diverse organisms including bacteria, 
fungi, plants, and animals (GrahanohLorence, supra) . The B-class is foimd in prokaiyotes and fungi, 
while the E-class is found in bacteria, plants, insects, vertebrates, and mammals. Hve subclasses or 

5 groups are found within die larger family of E-class cytochromes P450 (PRINTS EP4S0I&CIass 
P4S0 Group I signature). 

All cytochromes P4S0 use a heme cofactor and share structural attributes. Most cytochromes 
P450 are 400 to 530 amino acids in length. Thesecondarystrocture of the enzyme is about 70% 
alpha4ielical and about 22% beta-sheet The region an)und the heme4>inding site in the 04enninal 

10 part of the protein is conserved arDangcytochroiDesP4S0. A ten amino acid signature sequence in 
this hem&-iroa ligand region has been identified which includes a conserved cysteine involved in 
binding the heine iron in the fifihcoordiiiation site. In eukaryotic cytochromes P4S0, a 
membrane-spanning region is usually found in the first 15-20 amino acids of the protein, generally 
consisting of proximately 15 hydrophobic residues followed by a positively charged residue. (See 

15 Prosite PDO(X)0081, supnu GrahanohLc»:ence, supra. ) 

Cytochrome P450»izyrnes are involved in ceU proliferation and d Theenzymes 
have roles in chemical mutagenesis and carcinogenesis by metabolizing chemicals to reactive 
mtermediates that form adducts with DNA (Nebert, D.W. and Gonzalez, F.J. (1987) Ann. Rev. 
BiocheuL 56:945-993). These adducts can cause nucleotide changes and DNA rearrangements that 

20 lead to oncogenesis. Cytochrome P450 expres^on in liver and other tissues is induced by xenobiotics 
such as polycyclic aromatic hydrocarbons, peroxisomal proliferators, phenobarbital, and the 
glucocorticoid dexami^hasone (Dogra, S.C et al. (1998) CliiL Exp. Pharmacol. Physiol. 25: 19). A 
cytochrome P450 protein may participate in eye development as mutations in the P450 gene CYPIBI 
cause primary congenital glaucoma (Online Mendelian hihentance in Man (OMIM) *601771 

25 Cytochrome P450, subfemily I (dioxin-inducible), polypeptide 1; CYPIBI). 

Cytochromes P450 are associated with inflammation and infection. Hepatic cytochrome 
P450 activities are profoundly ajBFected by various infections and inflammatory stimuli, some of 
which are suppressed and some induced (Morgan, E.T. (1997) Drug Metab. Rev. 29:1129-1188). 
Effects observed in vivo can be minucked by proinflammatory cytokines and interferons. 

30 Autoantibodies to two cytochrome P450 proteins were found in patients with autoimmune 

polyenodocrinopathy-candidiasis-ectodermal dystrophy (APECED), a polyglandular autoimmune 
syndrome (OMIM *240300 Autoimmune polyenodocrinopathy-candidiasis-ectodermal dystrophy). 

Mutations in cytochromes P450 have been linked to metabolic disorders, including congraiital 
adrenal hyperplasia, the most common adrenal disorder of infancy and childhood; pseudovitamin D- 

35 deficiency rickets; cerebrotendinous xanthomatosis, a lipid storage disease charact^ized by 
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progressive neurologic dysfancdon, premature atherosclerosis, and cataiacts; and an inherited 
resistance to the anticoagulant drugs coumaiin and war&rin (Isselbacher, K.J. et al. (1994) Ifarrison's 
Principles of Internal Medicine. McGraw-Ifill, Lie. New Yoik» NY, pp. 1968-1970; Takeyama, EL et 
al. (1997) Science 277:1827-1830; Kitanaka, S. et al. (1998) N. Engl. J. Med. 338:653-661; OMIM 

5 '>213700CerBbrot^diaousxanthoinatosis; and OMIM #122700 Extremely 
hig^ levels of e?q)ression of the cytochrome P4S0 protdn aromatase were found in a iShrolamellar 
hepatocellular carcinoma from a boy with severe gynecomastia (feminization) (Agarwal, V.R. (1998) 
J. Clin, l^ocrinol. Metab. 83:1797-1800). 

The cytochrome P4S0 catalytic cycle is completed through reduction of cytochrome P4S0 by 

10 NADPH cytochrome P4S0 reductase (CPR). Another microsomal electron transport system 
consisting of cytochrome bS and NADPH cytochrome bS reductase has been widely viewed as a 
minor contributor of electrons to the cytochroim P4S0 catalytic cycle. However, a recent report by 
Lamb, D.C et aL (1999; FBBS Lett 462:283-288) identifies a Candida alMcans cytochrome P4S0 
(CYPSl) which can be eiBBd^y reduced and siqyported by the microsomal cytochrome bS/NADFH 

15 (^tochronoe bS reductase system. Therefore, there are likely many cytochromBS'P4S0 which are • 
supported by this alternative electron donor systenL 

Cytodborome b5 reductase is also responsible for the reducticm of oxidized hemoglobin 
(methemoglobin, or ferrihemoglobin, which is unable to carry oxygra) to the active hraioglobin 
(ferrohemoglobin) in red blood ceUs. Methemoglobiiieima results when there is a high level of 

20 oxidant drugs or an abnormal hemoglolrin (hemoglolmi M) which is not efBdently reduced. 

Methemoglobinemia can also result fiom a hereditary deficiency in red cell cytochrome bS reductase 
(Reviewed m Mansour, A. and Lurie, AA. (1993) Am. J. Hematol. 42:7-12). 

Kfenibers of the <7tocfanxrne P4S0 £mn]y are als^ 
and catabolism. Vitamin D exists as two biologically equival^t prohormones, ^gocalciferol 

25 (vitamin Dj), produced in plant tissues, and cholecaldferdl (vitamin D3), produced in animal tissues. 
The latter f c»m, cholecalcifi^l, is formed upon the exposure of 7-ddiydrocholesterol to near 
ultraviolet light Q.e., 290-3 1 0 nm), normally resulting fiom even Tninimal periods of skin ^qposure to 
sunlig^ (reviewed in Miller, WX. and Portale, AJl (2000) Tr^ids Endocrinol. Metab. 11:315-319). 
Both prohormone forms are further metabolized in the liv^ to 2S-hydroxyvitamin D 

30 (25(OH)D) by the enzyme 25-hydroxylase. 25(OH)D is the most abundant precursor form of vitamin 
D which must be further metabolized in the kidney to the active form, la,25-dihydroxyvitamin D 
(la,25(OH)2D), by the enzyme 25-hydroxyvitamin D la-hydroxylase (la-hydroxylase). Regulation 
of la,25(OH)2D production is primarily at this final step in the synthetic pathway. The activity of 
la-hydroxylase depends upon several physiological factors including the circulating level of the 

35 ^izyme prodnct (la,2S(OH)2D) and the levels of parathyroid hormone (PIH), caldtonin, msulin. 
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calcimn, phosphorus, growth honnone, and'prolactm. Fiuthennore* extrarenal la-hydroxybse 
activity has been rqxirted, suggesting tbiat tissue-specific, local regulation of la,2S(OH)2D 
production may also be biologically inqxstant The catalysis of la^dJEOJ^ to 
24,25-dihydroxyvitanun D (2A^{OH)J>X involving the enzyme 2S-hydroxyvitamm D 
5 244iydroxylase (244iydroxylase), also occurs in the kidney. 244iydroxylase can also use 2S(0H)D as 
a substrate (Shndd, T. et al. (1997) Ptoc. Natl. Acad. ScL U.SA. 94:12920-12925; MiUor, Wi. and 
Portale, A.A. supra: and refi^nces witlun). 

Vitanun D 2S-^ydioxylase, la-hydroxylase, and 244iydroxylase are all NADFH-dep^dent, 
type I (mitochondrial) cytochrome P450 em^mes that show a high degree of homology with other 

10 members of the femily. Vitamin D2S4iydroxylase also shows a bit>ad substrate specificity and may 
also p^onn 26-hydioxylation of bile add intermediates and 25, 26, and 27-hydro3Eylation of 
cholesterol (Dflworth. F.J. et aL (1995) L Biol. Oiem. 270:16766-16774; Mfller, WX. andPortale, 
AA. supra: and references within). 

The active form of vitamin D (la,25(OH)2D) is involved in calcium and phosphate 

15 homeostasis and promotes the differentiation of n^eloid and skin cells. Vitanun Ddefid^cy 

resulting fiom defidendes in the enzymes involved in vitamin D metabolism (e.g., la-hydroxylase) 
causes hypocalcemia, hypophosphatemia, and vitamin D-dependent (sensitive) riclcets, a disease 
characterized by loss of bone doisity and distinctive clinical fisatures, including bandy or bow 
leggedness accompanied by a waddling gaiL Deficioides in vitamin D254iydroxylase cause 

20 cerebrotendinous xanthomatosis, a lipid-storage disease characterized by the dq>osition of cholestml 
and cholestanol in the Achilles* tmdons, brain, lungs, and many other tissues. The disease presents 
with progressive neurologic dysfunction, including posqiubescent cerebellar ataxia, ath^oscleiosis, 
and cataracts. Vitamin D25-hydro9^1asede£udeiK7 does not resuh in rickety 
existence of alternative pathways for die synthesis of 25(OH)D (GrifiBn, JJBw and Zerwddi, J£. 

25 (1983) J. Cain. Invest 72:1190-1199; Gamblin, G.T. et al. (1985) J. Clin. Invest 75:954-960; and 
MiUer, WX. andPortale, AA. supra) , 

Fenedoxin and foiedoxin reductase are dectrm transport acc^^ 
at least one human cytochrome P450 spedes, cytochrome P450c27 racoded by the CYP27 gene 
(Dilworth, FJ. et al. (1996) Biochem. J. 320:267-71). A Streptomvces griseus cytochrome P450, 

30 CYP104D1, was heterologously expressed in E. coli and found to be reduced by the endogenous 
ferredoxin and fetredoxin reductase enzymes (Taylor, M. et al. (1999) Biochem. Biophys. Res. 
Commun. 263:838-42), suggesting that many cytochrome P450 species may be supported by the 
fenedoxin/ferredoxin reductase pair. Foxedoxin reductase has also been found in a model drug 
metabolism system to reduce actinomydn D, an antitumor antibiotic, to a reactive fiee radical spedes 

35 (Flitter, WJD. and Mason, RJP. (1988) Arch. Biochem. Biophys. 267:632-^39). 
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Flavin-mntamifi g monooxvp gnasft fPMO) 

Flavm-contaiimig monooxy^nases oxidize the nucleopUIic nitrog^ sulfiir, and phosphorus 
heteroatomof an exceptional rang^ of substrates. like cytochromes P4S0,FNfOsaie microsomal and 
use NADPH and O2; there is also a great deal of substrate overlap with cytochromes P4S0. The tissue 

S distribution of FMOs includes liver, kidney, and lung. 

There are five differmt known isoforms of FMO in mammals (FMOl, FM02, FMOS, FM04, 
andlMOS), wMchaiee7qiressed]natissu&-specificnianner. Hie isoforms differ in their substrate 
specificities and other properties such as inhibition by various compounds and stereospecificity of 
reaction. FMOs have a 13 amino add signature sequence, the con^onents of wMch span t^ 

10 t^imnal two-thirds of the sequmces and include tiie FAD binding region and the FATGY motif 
which has been found in many N-faydroxylating eazyrass (Stefar, M. et al. (1998) Trends Biochem. 
Sci. 23:56-57; PRINTS FMOXYC9BNASE Flavin-contaming monooxygenase signature). 

Specific reactions include oxidation of nucleophilic tertiary amines to N-oxides, secondary 
amines to hydroxylamines and nitrones, primary amines to hydroxylamines and oximes, and suUur- 

15 containing compounds and phosphines to S- and P-oxides. Hydrazines, iodides, selenides, and boron- 
containing con^unds are also substrates. Although FMOs appear similar to cytochromes P450 in 
their chemistry, they can generally be distinguished from cytochromes P450 in vitro based on, for 
exanq)le, the higher heat lability of FMOs and the ncmionic detergent sensitivity of cytochromes 
P450; however, use of these properties in identification is con^Ucated 1^ fiirther variation among 

20 FMO isoforms with respect to thermal stability and detergent srasitivity . 

FMOs play iiiqxxrtant roles in the metabolism of several dbrugs and xra FMO (FMOS 

in liver) is predominant^ responsible for noetaboliang (S)-nicotine to (S>^cotine N-l'-oxide, which 
is excreted in urine. FMO is also involved in S-oxygenation of cim<^ dine, an Hg-antagpnist widdy 
used for the treatment of gastric ulcers. liver-^aqpiessed forms of FMO are ixotuuler the same 

25 regulatory control as cytochrome P450. In rats, for example, phraiobaxbitaltreatrnent leads to the 
induction of cytochnnne P450, but die repression of FMOl. 

Endogpnoos substrates of FMO include cysteanrnie, which is oxidized to the distilfide, 
cy5tamine,andtiiin^ylamine(114A),i7^hic^ TMA 
smells like rotting fish, and mutations in the FMOS isof orm lead to large amounts of the malodorous 

30 free arziine being excreted in sweat, urine, and breath. These symptoms have led to the designation 
fish-odor syndrome (OMIM 602079 Trimethylaminuria). 
Lysvl oxidase ' 

Lysyl oxidase (lysine 6-oxidase, LO) is a copper-dq)endrat amine oxidase involved in the 
fonnation of comiective tissue matrices by crossfiokiiig collagen and elastin. LO is secreted as an N- 
35 glycosylated piecursOT protem of approximately 50 kDa and cleaved to the mature form of the 
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eiizymeby a metaUoprotease, although ThecoiqiOTatommLOis 
involved in the transport of electrons to and firom oxy gsa to facilitate the oxidative deamination of 
lysine residues in these e)araceUuIar matrix proteh^ While the cooixUnation of copp^ is essential to 
LO activity, insufficient dietary intake of copper does not influence the expression of the q)oenzyiiie. 

5 However, the absence of the functional LO is linked to the skeletal and vascular tissue disorders tiiat 
are associated with dietary cqpper deficiency. LOisalsoinhibitedby a variety of semicarbazides, 
hydrazines, and amino nitrites, as well as hq>arin. Beta-aminopropioiiitrile is a commonly used 
inhibitor. LO activity is increased in response to ozone, cadmium, and elevated levels of hormones 
released in response to local tissue trauma, such as transforming growth £actor-beta, platelet-derived 

10 growtii factor, angiotODsinll, and fibroblast growth £3ctor. Afanoimalities in LO activity have been 
linked to Nfenkes syndrome and ocdpital horn syndrome. Cytosolic forms of the enzyme have been 
iiiq>Iicated in abnormal cell proliferation (reviewed in Rucker, R3. et al. (1998) Am. J. Clin. Nutr. 
67:996S-10Q2S and Smith-Mungo, LI. and Kagan, RM. (1998) Matrix Biol. 16:387-398). 
Dihvdrofolate reductases 

15 Dihydrofolate reductases (DHFR) are utnquitous razymes tiuit catalyze the 

NADPH-dependent reduction of dihydrofolate to tetrahydrofolate, an essential step in the denovo 
synthesis of glycine and purines as well as the conversion of deoxyuridine mcxioiAiosphate (dUMP) to 
deoxythymidinemonoiAospliate(dTMP). The basic reaction is as follows: 

20 7,8^ydrofolate + NADPH 5,6,7,8-tetiahydrofolate + NADP^ 

The enzymss can be inhibited by a number of dihydrofolate analogs, including trimeOrqEKrim and 

m^otrexalB. Since an abundaiK:e of dTNO^ is required for DNAsyntiiesis^ra 

require ttie activity of DHFR. The rq[>ficatianofDNA viruses (i.e.,heEpesviras) also requires liig^ 

25 levels of DHFR activity. As a result, drugs tiiat target DHFR have been used for cancer 

Chemother^ and to inMlMtDNA virus r^licatioQ. (Fdr similar reasons, thymidylate syrithetases are 
also target enzymes.) Dnigstiiat inhibit DHE^ are prefieientially cytotoxic f<^ 
(or DNA vims-infiscted cdis) but have no spedfidty, resulting in the indiscrinmnate destruction of 
dividing cells. Furtiiernuie, cancer cells rnay become resistant to drugs such as rnetto 

30 result of acquired transport defects or the diq»lication of one or more DHFR genes (Stiyer, L. (1988) 
Rinrhftinistrv. W.H Freeman and Co., hic. New York. pp. 511-5619). 
Aldo/keto reductases 

Aldo^keto reductases are monomeric NADPH^lq>endent oxidoreductases with broad 
substrate specificities (Bohren, K.M. et al. (1989) L Biol. CheuL 264:9547-9551). These enzymes 
35 catalyze the reduction of carbonyl-containing cQ^^x^unds, including carbonyl-containiag sugars and 
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aiomatic coiiqx>iiiids» to the coiiespcmdu^ Therefore, a variety of cartjaayl-coiitaii^ 

drugs and xenobiotics aie likely metabolized by enzymes of this class. 

One Imownieaction catalyzed by a ftiidly menibe^ aldose reductase, is the reduction of 
glucose to sorbitol, which is thra further metabolized to fnictose by sorbitol ddiydrogeoase. Under 
5 nocmal conditions, the reduction of glucose to sorbitol is a mmor pathway. In hyperglycemic states, 
however, tiie accumulation of sorbitol is implicated in the development of diabetic complications 
(OMM *103880 Aldo-keto reductase &mily 1, memb^ Bl). Members of this CTzyme family are 
also highly expressed in some Uver cancers (Cao» D. et aL (1998) J. Biol. Oiem. 273:11429-11435). 
Another inqx>rtant mernb^ of the aldo-keto reductase family is NADP^-dqpendent morphine 

10 6-dehydrQgenase (MDH). MDHcatalysestfaeoxidaticmof the 64iydroxylgioiq[>s of morphine and 
codeine to ketones, resulting in the formation of morphincHie and codeinone. Oxidation by MDH 
representsoneof several pathways for the metabolism of the opiates. Other pathways mvolve 
ghicunmidaticxDu CMemethylation and N-deaQ^lation. Active opiate metabolites are produced by 
several of these enzymatic reactions (Cone, EJ. et al. (1978) Drug. Metab. Dispos. 6:488-93 and 

15 Tegeder,Letal. (1999) Gin. Pharmacokinet 37:17-40). Purified MDH has shown promise in 

detecting the presence of opiates in biological samples (Holt, P. J. et al. (1996) AnaL Chem. 68:1877- 
82). 

Alcohol dehvdropenases 

Alcohol dehydrogenases (ADEb) oxidize simple alcohols to the correspcmding aldehydes. 

20 ADH is a cytosoHcen^nie, prefers the cofiactorNAD% and also binds zinc io^ Liver contains the 
highest levels of ADH, with lower levels in kidney, lung, and the gastric mucosa. 

Known ADHisofonns are dinieric proteins composed of 40 ld:>asubunite^ Thoearefive 
known gene loci which encode these subunits (a, b, g, p, c), and some of the loci have characterized 
allelic variants (bj, b^ b^, g]» gz). The subunits can form homodimers and hetraodimers; tiie subunit 

25 coiiqK)sitiondeteinmies the spedfipprop^esoftbe active enzyme. The holoenzymes have 
therefore been categorized as Qass I (subunit compositions aa, ab, ag, bg, gg). Class n (pp), and 
ClassIII(cc). QassIADHisozynaBSCKudizeetfaanolaiidoth^sinaUalipfaatical^^ 
inhibited by pyrazole. Class nisozyn^ prefer longer chain aliphatic and aroinaticalcdio^^ 
unable to oxidize methanol, and are not inhibited by pyrazole. Qass in isozymes prefer even longer 

30 chain aliphatic alcohols (five carbons and long^) and aromatic alccdiols, and are not xohibited by 
pyrazole. 

The short-chain alcohol dehydrogenases include a number of related enzymes with a variety 
of substrate specificities. Included in this group are the mammalian enzymes D-beta-hydroxybutyrate 
ddiydrogooase, (R)-34iydroxybutyrateddiydrogenase, 154iydroxypro5taglandin dehydrogenase, 
35 NADPH-dependent carbonyl reductase, corticosteroid 1 l-beta-ddiydro^oase, and estradiol 17-beta- 
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dehydrogenase, as well as the bacterial enzymes acetoacetyl-CoA leductase, glucose 1- 
dehydrograia^, 3-beta-hydro^steioid dehydiog^iase, 2(H)eta-hydioxystQroid dehydrogenase, liMtol 
dehydrogenase, 3-oxoacyI reductase, 23-dihydro-2,3-dihydroxybeDzoate dehydrogenase, saihitol-6- 
phosphate 2-dehydiogenase, 7-alpha-4iydn>xystooid ddiydrog^nase, cis-l^-dihydroxy-3,4- 
5 cyclohexadiene-l-caiboxylate dehydrogenase, cu^toluene dihydrodiol dehydrogenase, cu'-benzeae 
glycol dehydrogenase, biphenyl-2,3-dihydro-2,3-diol dehydrogenase, N-acylmannosamme l-> 
dehydrogenase, and 2-deoxy-D-glucQnate 3-dehydrogenase (Krozowski, Z. (1994) J. Steroid 
Biochem. Mol. Biol. 51:125-130; Krozowski, Z. (1992) Mol. Cefl Endocrinol. 84:C25-31; and Marks, 
AJL et al. (1992) J. BioL C3ienL 267:15459-15463). 

10 tJDP giucuronvltransferase 

Members of the UDP glucuronyltransferase family (UGTs) catalyze the transfer of a 
glucuronic acid group ficun the cofactor uridine diphosphate-glucuronic add (UDP-glucuronic acid) 
to a substrate. Thetnmsferis'genetaUy toaauclecq)hificheteroatom(0,N,or Substrates include 
xenobic^cs which have been foncdonalized by Phase I reactions, as well as mdogenous conqKyunds 

15 such as bilirubin, stercnd honnones, and thyroid hormones. Pioducts of g^ucuronidation are excreted 
in urine if the molecular weight of the substrate is less than about 250 g^nool, whereas larger 
glucunmidated substrates are excreted in bile. 

UGTs are located in the microsomes of liver, kidney, intestine, skin, brain, spleen, and nasal 
mucosa, where they are on die same side of the endoplasmic reticulum memihrane as cytochrome 

20 P450 engines and flavin-containing monooxygenases, and therefore are ideally located to access 
products of Phase I drug metabolisnt UGTs have a C-tenninal memhrane-spanniiig domain which 
anchors theoi in die euloplasmic reticulum membrane, and a ccHiserved signature domain of about 50 
amino add residues in their C tmmnal section (Prosite FDOO0Q359 UDP-^cosy Itransferase 
signature). 

25 UGTs involved in drug m^abolism aie mcoded by two gpue funilies, UGT 1 and UGT2. 

Members of the UGTl family result from alternative splidng of a single gene locus, which has a 
variable substrate binding domain and coaistant r^on involved in co&ctor IwmKng and membfane 
inserticMi. Members of the UG* Vz family are encoded by separate gessnt lod, and are divided into two 
families, UGT2A and UCrr2B. The 2A subfamily is expressed in olfactoty qrithehm 

30 subfamily is expressed in liv^ndcrosomes. Mutations in UGT genes are associated with 

hyperbilirubinemia (OMIM #143500 Hyperbilirubinemia 1); Crigler-Najjar syndrome, characterized 
by intense hyperbilirubmemia from birth (OMIM #218800 Crigler-Najjar syndrome); and a milder 
form of hyperbilirubinemia turned Gilbert's disease (OMIM *191740 UGTl). 
Sulfotransferase 

35 Sulfide conjugation occurs m many of the same substrates which undergo O-glucuronidation 
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to produce a higUy water-soluble sulfuric acid ester. Sulf otransf^ases (ST) catalyze this leaction by 
transfecring SO 3' from the co&ctor 3*-phosphoadeiiosine-5H>hosphosulfate (PAPS) to the substrate. 
ST substrates are predominantly phenols and aliphatic alcohols, but also mclude aromatic amines and 
aliphatic amines, which are conjugated to produce the corresponding suliamates. The products of 
5 these reactions are excreted mainly in urine. 

STs are found in a wide range of tissues, includmg liver, kidney, intestinal tract, lung, 
platelets, and brain. The enzymes are generally cytosolic, and multiple forms are often co-expressed. 
For exanq>le, there are rnore than a dozen forms of ST in rat liver cytosoL These biochenucally 
characterized STs feJi into five classes based on their substrate preference: arylsulfotransferase, 
10 alcohol sulf otransferase, estrogen sulfotransferase» tyrosine ester sulfotransfisrase, and tnle salt 
sulfotransferase. 

STenzymeactivity varies greatly with sex and age in rats. The combined effects of 
developmental cues and sex-related honnones are thought to lead to these differences in ST 
expression profiles, as well as the profiles of other DMEs such as cytodiromes P450. Notably, the 
15 hi^ expression of STs in cats partially compensates for their low level of UDP glucuronyltransferase 
activity. 

Several forms ofST have been purified from human Uv^cytosol and cloned. Therearetwo 
phraolsuffotraiisferases with different therindstabiUties and substrate p^ The 
thermostable CTzyme catalyzes the sul&tion of {dienols such as paiaHutrophenol, minoxidil, and 

20 acetaminophen; the thermolabile enzyme prefers monoamine substrates such as dopamine, 
epmephrine, and levadqpa. Other cloned STs include an estrpgen sulfotraiisferase and m 
acetylglucosamine-frO-sulfotransCsrase. This last ena^yme is illustrative of Ae other major role of 
STs in cellular biochemistry, the modification of carbohydrate structures tfiat may be inqportant in 
cellular diffi^ntiation and maturation of proteoglycans. lodeed, an inhraited def ect in a 

25 sulf otransferase has been implicated in macular corneal dystrophy, a disord^ charactraized by a 
failure to syndiesize mature kraatan sulfete proteoglycans (Nakazawa, K. ^ aL (1984) J. Biol. CbBssL 
259: 13751-13757; OMIM m7800 Macular ^strophy, cmieal). 
Galactosvltransfierases 

Galactosyhransfoases are a subset of glycosy Itransfierases that transfer galactose (Gal) to the 
30 terminal N-acetylglucosamine (GicNAc) oligosaccharide chains that are part of glycoproteins or 
glycolipids that are free in solution (Kolbmger, R et al. (1998) J. Biol. Chem. 273:433-440; Amado, 
M. et al. (1999) Biochim. Biophys. Acta 1473:35-53). Galactosyltransferases have been detected on 
the cell surface and as soluble extracellular proteins, in addition to being present in the Golgi. pi3- 
galactosyltransferases fonn Type I carbohydrate chains with Gal (pi-3)GlcNAc linkages. Known 
35 human and mouse pl3-galactosyltransfaases appear to have a short cytosolic domain, a single 
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transmemhtane domam, and a catalytic domain with eight conserved regions. (Kolhmg^, supra and 
Hennet, T. et aL (1998) J. Biol. Chrat 273:S8-€5). hi mouse UDP-galactose:P-Nnac^lgIucosamuie 
pl,3-galactosyltrans£B(rase-I region 1 is located at ammo acid residues 78-83, r^on 2 is located at 
amino add residues 93-102, region 3 is located at ammo acid residues 1 16-1 19, region 4 is located at 

5 ammo add residues 147-1S8, region 5 is located at amino add residues 172-183, region 6 is located 
at amino add residues 203-206, region 7 is located at annno add residues 236-246, and region 8 is 
located at amino add residues 264-275. A variant ofasequrace found within mouse UDP- 
galactose:P-N-acetylglucosamine pi,3-galactosyltransferase-I region 8 is also found in bacterial 
galactosy Ittansferases, suggestmg that tMs sequence defines a galactosy Itransferase sequence motif 

10 (Hennet supra) . Recent work suggests that faraiiiiac protein is a pl,3-galactosyltransfen^ 
et al. (1997) CdQ 88:9-11; and Hennet, supra) , 

UDP-GatGlcNAc-l,4-galactosyltraiisferase (-1,4-GalT) (Sato, T. et al.. (1997) EMBO J. 
16:1850-1857) catalyzes the formation of Type II carboli^drate chains with Gal (pl-4)GlcNAc 
linkages. As is the case with the pi,3-galacto$yltransferase, a soluble form of the eozyme is fonned 

15 by cleavage of the membrane-bound fonn. Amino acids conserved among pi,4-galactosyltransfiarases 
include two cystdnes linked through a disulfide-bond and a putative UDP-galactose-binding site in 
the catalytic donnin (Yadav, S. and Brew, K. (1990) J. Biol. Chem. 265:14163-14169; Yadav, SP. 
and Brew, K. (1991) J. Biol. Chem. 266:698-703; and Shaper, NJL. et aL (1997) J. Biol. CbessL 
272:31389-31399). pi,4-gaIactosyltransfe]ases have sev^ specialized roles in addition to 

20 synthesizing carbohydrate chains cm glycoproteins or glycolq)ids. In manmials a pi,4- 

galactosyltransf erase, as part of a heterodimer with a-lactalbumin, functions in lactating mammary 
gland lactose productioiL Api,4-galacto5yItransfBiase(mthesurfaceof spennfoncti 
recqytordiat specifically recognizes die egg. CdDsuifiicepi,4-galactosyltransferasesalsofm^ 
cell adhesion, ceO/basal lamina interactioD, and normal and metastatic cell migration. (Shur, B. 

25 (1993) Cmr. Opin. C3ell BioL 5:854-863; and Shaper, J. (1995) Adv. Bxp. Med, BioL 376:95-104). 
Glntatfaiane S-transfiarase 

The basic reaction catalyzed by glutatluoiie S4ransferases (GST) is the conjug^liGn of an 
electrophile with reduced glutafliione(GS^: GSTs are homndimeric or h^erodiineric protdiK 
localized rnainly in the cytosol, but sorne level of activity is present in rmcros The 

30 major isozymes share common structural and catalytic properties; in humans they have been 

classified into four rmjor classes. Alpha, Mu, Pi, and Theta. The two largest classes. Alpha and Mu, 
are identified by their respective protein isoelectdc points; pi - 7.5-9.0 (Alpha), and pi - 6.6 (Mu). 
Each GST possesses a common binding site for GSH and a variable hydrophobic binding site. The 
hydrophobic binding site in each isozyme is specific for particular electrophilic substrates. Specific 

35 amino add residues within GSTs have been identified as important for these binding sites and for 
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catalytic activity. Residues Q67, T68, DlOl, E104, and R131 are important for the binding of GSH 
0^. R-C. et al. (1995) J. Biol. Oiem. 270:99-109). Readues R13, R20. and R69 arc in^Kartant for 
the catalytic activity of GST (Stenbeig, G. et al. (1991) Biochem. J. 274:549-555). 

Jn most cases, GSTs perfonn the beneficial function of deactivation and detoxification of 
5 potratiaUy nnitagenic and carcinogenic chemicals. However* in some cases their acdon is detrimental 
and rcsults in activation of chemicals with consequent mutageoic and carcinogenic effects. Some 
forms of rat and human GSTs arc rcliable preneqplastic marki^ that aid in the detection of 

CarcinOg^esiS. T7-gp»>cctn«i r\f human HCTg in Kar^ftrial gfraing^ snrh as J^altfirmft Ua tvphimurium 

used in the weU-known Ames test for mutagenicity, has helped to establi^ - 
10 in motag^esis. Dihalomethanes,wMch produce Hvertuniors in mice» arc beUeved to be activate 
GST. This view is supported by the finding that dihalomethanes are more mutagenic in bactmal cells 
expressing human GST than in untransfected cells (Thier, R. et al. ( 1993) Proc. NatL Acad. Sci. USA 
90:8567-8580). The mntagenidty of ethylCTedibromideaad ethylene dicUcHide is increased in 
bacterial cells e?qpressing the human Alpha GST, Al-1, while the mutagenicity of aflatoxin Bl is 
15 substantially reduced by enhancing the expiessira of GST (Simula, TP. et aL (1993) Carcinogenesis 
14:1371-1376). Thus, control of GST activity may be usefid in the control of mutag^neds and 
carcinogenesis. 

GST has be«i iriq>Ucated in the acquired resistance of inany cancers to drug treats 
phenomenon known as multi-drug resistance (MDR). MDR occurs whm a cancer pati^ is treated 

20 with a cytotoxic drug such as cyclophosphamide and subsequently becomes resistant to fliis drag and 
toavarietyof other cytotoxic agents as well. Increased GST levels arc associated with some of these 
drug resistant caiicers, and it is befieved that ttiis increase occurs in response to d^ 
is then deactivated by tfie GST catalyzed GSH conjugation reaction. The increased (SST levels then 
protect the cancer celkfiomottier cytotoxic ag^its which bind to GST. Increasedlevelsof Al-1 in 

25 tuinors has been linked to drug resistance induced by cyclophosphamide 

(1994) Cancer Res. 54: 6215^220). Thnscontrolof GST activity in cancerous tissues may be usefid 
in treatn^g MDR in cancer patients. 
Ga"'Tna..piiitafnYl t ranspeptidase 

Gainma-^utarnyl tranqieptidases arc ubiquitoiisl^ 

30 ^dracellular glutathione (GSH) breakdown by cleaving gamma-glutamyl amide bonds. The 
breakdown of GSH provides cells with a regional cysteine pool for biosynthetic pathways. 
Ckumna-^utamyl transpqptidases also contribute to cellular antioxidant defenses and e^)ression is 
induced by oxidative stress. The cell sur&ce-localized glycoprcteins.arc expressed at hi^ levels in 
cancerceOs. Studies have suggested diat the high level of ganima-^g^ntaniy 

35 present on the sur&ce of cancer cells could be exploited to activate precursor drugs, resulting in hig^ 
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local concentiations of and-cancer ther^utic agmts (Hanigan, MJBL (1998) Cheia Biol. lateiact 
111-112:333^2; TamgucW, N. and Ikeda, Y. (1998) Adv. Bizymol- Relat. Areas MoL Biol. 
72:239-78; Qnkbi, N. et aL (1999) Comp. Biochem. Physiol. B. Biochem. MoL BioL 122:367-380). 
Acyltransfesrase 

5 N-acyltiansf erase enzymes catalyze die transfer of an amino acid conjugate to an activated 
caiboxylic group. Eadograous concqpounds and jcenobiotics aze activated by acyl-CoA synthetases in 
die cytosoU midosomes, and mitochondiia. The acyl-CoA intermediates aie then conjugated with an 
amino acid (^ically glycine, glatamine, or taurine, but also ornithine, argfaiine, histidine, serine, 
aspaitic acid, and sev^ dipeptides) by N-acyltransferases in the c^tosol or rmtochondria to form a 

10 metabolite with an andde bond. This reaction is con^lemraitary to O-ghicuionidatioQ, but amino ad 
conjugation does not produce the reactive and toxic metabolites which often result firom 
glucuronidation. 

One well-characterized enzyme of tiiis class is the bile add-CoA: amino add N- 
acyltnmsferase (BAT) responsible for generating the bile add conjugates which serve as detergents in 
IS ttie gastrointestinal tract (Falany, C.N. et al. (1994) J. Biol. Chem. 269:19375-19379; Johnson, MJL 
et al. (1991) J. BioL CSiem. 266:10227-10233). BAT is also useful as a predictive indicator for 
prognosis of hepatoceDular carcinoma patients after partial hepatectomy (E'urutam, M. et al. (1996) 
Hepatology 24:1441-1445). 

20 Acetyltransferases have been extensively Studied for tildr role in histone acetyl^ EQstone 

acetylation results iix die relaxing of die chromatin structure in eiikary otic cells, allowing transcription 
factors to gain access to promoter elements of die DNA ten^lates in the affected region of the 
genome (or the g^ome in graeral). hi contrast, histane^acetylatian results in a rednction in 
transcripticn by closing die chromatin slrucl u ie and Kmitiiig access of transcripticn factors. Tothis 

25 end, a common means of stimulating cell transcd^ticm is die use of chemical agents that inhibit the 
deac^lation of histories (e.g., sodium butyrate), resulting in a global (albeit artifactual) increase in 
gene esqxression. The modulation of gene e^qiression by ac^lation also results from the acetylation 
of othea-protdns, nichidingbut not limited to, p53, GATA-1, MyoD, ACTR, TFOE, TFDF and die 
high mobility groi^ pn^ins (HMG). hi the case of p53, acetylation results in incTBased DNA 

30 binding, leading to the stimulation of transcription of genes regulated by pS3. The prototypic histone 
acetylase rB[AT> is Gcii5 from Saccharomvces cerevisiae . Gcn5 is a member of a f amily of acetylases 
that includes Tetrahymena p55, human Gcn5, and hvman p300/CBP. Histone acetylation is reviewed 
in (Cheung, WI^ et al. (2000) Cuir. Opin. CeU Biol. 12:326-333 and Berger, SX (1999) Curr. Opin. 
Cell BioL 11:336-341). Some acetyltransferase enzymes possess the alpha/beta hydrolase fold 

35 (Cmter of Applied Molecular &igtneering Inst of Ch^stry and Biochemistry - University of 
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Salzburg, http://piedictsangqr.ac.i]k/iibm-<x>^^ 

classes of enzymss, mcluding but Bot limited to, acetylcholinesteiases and caiboxylesterases 
(Stroctuial Classificatioii of Proteoms, littp://sccqi.iDrc-liiib.caiaac.id[/sc^ 
N-acetvltransfmse 

5 Aromatic amines and hydrazine-containing conqK>unds are subject to N-acetyladon by the N- 

acetyltransfisase enzymes of liver and other tissues. Some xenobiotics can be O-acetylated to some 
extent by the same enzymes. N-acetyltransferases are cytosolic enzymes which utilize the cofactor 
acetylK:oenzyme A (acetyl<:bA) to transfer the acetyl gn>iq> in a two step hi the first step, 

the acetyl groiq> is transfierred from acetyl-CoA to an active site cysteine residue; in the second step, 

10 the acetyl groiq> is transferred to the substrate amino group and the enzyme is regenerated. 
In contrast to most other DME classes, there are a limited number of known N- 
acetyltransferases. la humans; there are two higUy similar enzymes, NATl and NAT2; mice appear 
to have a third foim of the enzyme, NATS. The human forms of N-acetyltransferase have 
indqpendmt regulation (NATl is widely-^Tqnnessed, whereas NAT2 is in liver and gut only) and 

15 ovedapping substrate preferences. Both enzymes appear to accept most substrates to some extent, but 
NATl does prefer some substrates (para-aminobCTzoic acid, para-aminosalicylic add, 
sulfEunethoxazole, and sulfanilamide), while NAT2 prefers others (isoniazid, hydralazine, 
procainamide, dapsone, aminoglutethimide, and sulfamethazine). 
Qinical observations of patients taldoig the antitttbmi^ 

20 thedesciq)tiQnoffa5tandslowacetylatorsofthecQnqKiand. These phsnotypes were shown 

subsequently to be due to mutations in the NAT2 g^e wUdi affected enzyme activity or stability. 
The slow isoniazid acetylator phoiotype is very prevalent in Middle Eastern populations (s^prox. 
70%), and is less prevalent in Caucasian (approx. 50%) and Asian (<25%) populations- More 
recently, functional polymori^iism in NATl lias been detected, with approximately 8% of the 

25 pqpulatitm tested showing a slow acetylator phaiotype (Butcher, N. J. et aL (1998) Fharmacogen^ics 
8:67-72). Since NATl can activate some known aromatic amine cardnogens, polynmpfaism in the 
widdy-esqxTBSsedNATl em^memaybeiiiQxirtantindeteninniii^ 
aoetyhransferase 1). 

AmiiinLran sfacases 

30 Aminotiansfiaases comprise a family of pyridoxal S -phosphate (PLP) -dependent enzymes 

that catalyze transfonnations of amino acids. Aspartate aminotransferase (AspAT) is the most 
extensively studied PLP-containing enzyme, ft catalyzes the reversible transamination of 
dicacboxylic Lramino adds, aspartate and glutamate, and the corresponding 2-oxo acids, oxalacetate 
and 2-oxogIutarate. Otherioembersof the fimdiy include pynivate aminotransferase, branch 

35 amino acid aminotiansfiBrase, tyrosine aminotransferase, aromatic aminotransferase. 
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alaiiine:glyoxylate ammotransferase (AGT), and kynurenine aminotransfetase (Vacca, R.A. et aL 
(1997) J. Biol. ChenL 272:21932-21937), 

Primaiy hyperoxaluria typ&-l is an autosomal recessive disorder resulting in a deficioicy in 
die liv^r-specific peroxisomal enzyme, alaninerglyoxylate annnotransferase-1. The pheno^rpe of the 
S disordra is a deficiency in glyoxylale metabolism, hi the absence of AGT.glyoxylate is oxidized to 
oxalate nitb^dian being tnmsaminated to glycine. Hie result is the deposition of insoluble calcium 
oxalate in the kidneys and urinary tract, ultimately causing lenal failure (Lumb, M J. et al. (1999) J. 
Biol. Chem. 274:20587-20596). 

Kynurraiine aminotransferase catalyzes the irreversible transamination of the L^ryptophan 

10 metaboUtel^kynurenine to foam kynurenic acid. The enzyme may also catalyze the reversible 
transamination reaction between L-2-ammoadipate and 2-oxoglutaiate to produce 2-oxoadipate and 
Lrglutamate. Kynmenic acid is a putative modulator of glutamatergic neurotransmission; thus a 
deficiency in kynurenine aminotransferase may be associated widi pleotrophic eETects (Buchli, R. et 
al, (1995) J. Biol. Chem. 270:29330.29335). 

15 Catechol-O-methvltransferase 

Catechol-O-methyltransferase (COM!) catalyzes the transfer of the methyl group of 5- 
adenosyl-Lnmethionine (AdoMet; SAM) donor to one of the hydroxyl groups of the catechol substrate 
(e.g.» Lrdppa, dopamine, or DBA). Methylation of die 3'4iydroxyI group is favored over melhylation 
of the 4'-hydroxy 1 gmap and the m^nbrane bound isof onn of COMT is more re^ospedfic dian the 

20 solublefocm. Translationof the soluble form of the enzyiiie results fnomutilizaticm of an int 
start codon in a fidl-length mRNA (1.5 Id)) (ur from the translation of a shorter mRNA (1.3 kb), 
transcribed from an internal promoter. The proposed Sj^-like methylatim reaction requires Mg^ and 
is inhibited l^Ca^. The biiiding of the douOT and substrate to CX)MT occurs sequent^ AddM^ 
first bmds COMT in a M^^4ndq>endent manner, followed by the binding of Mg"^ and die bhiding of 

25 the catediol substrate. 

The amount of OOMT in tissues is relatively Ugjb compared to the amount of activity 
normally required, thus inhibition is proUexnatic. Natiethdess,inlubitors have been developed frnrm 
vitro use (e.g., g^llates, tropolone, U-QS21, and 3\4*Hiihydro3^-2Hmeayl-jp!ropioplietr^^ and fisr 
clinical use (e.g., nitrocatedhol-based cosaipounds and tolcapone). Adiiitnistration of diese inhibitors 

30 results in the increased half-life ofL^opa and the coiisequentforrnation of dc^ Inhibition of 
COMT is also likely to increase the half-life of various other catechol-structure compounds, including 
but not limited to ^inephrineAioorqiiiiephrine, isqporenaline, rimiterol, dobutamine, fenoldopam, 
qximoiphme, arid a-inethyldqpa. Adefidency innorqnnqphiinehasbeenliiikedtocliiiical 
depression, hence the use of COMT inhilnto^ could be usefull in the treatment of depressicHL 

35 COMT inhibitors are generally well tolerated with wmiiTnal side effects and are ultimately 
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metabolized in the liver with only minor accumulation of metabolites in the body (Mannist6, P.T. and 
Kaakkob, S. (1999) Phannacol. Rev. 51:593-628). 
Co pper-zinc snp^oxide dismutases 

Copper-zinc superoxide dismutases are conqpact homodimeric metalloenzymes involved in 

5 cellular defenses against oxidative damage- Hie enzymes contain one atom of zinc and one atom of 
copper per subunit and catalyze the disniutatian of sq»ei^ Therateof 
dismutation is difiusion-limited and consequently aihanced by the pcesrace of fav<»:able electrostatic 
inteiactions between the substrate and enzyme active site. Examples of diis class of enzyme have 
been identified in the cytoplasm of all the eukaiyotic. cells as well as in the pmplasm of several 

10 bacteiial species. Copper-zinc superoxide dismutases are robust enzymes that are higl^^ 

proteolytic digestion and denatonngby urea and SDS. In addition to the conq^act structure of the 
enzymes, the presence of the metal ions and intrasubunit disulfide bonds is believed to be responsible 
for enzyme stability. The enzymes undeirgo reversible denaturation at t^aq[)^^ 
OBattistoni. A. et al. (1998) J. Biol. Chem. 273:5655-5661). 

15 Ovexexpression of sup^xidedisnmtase has been in^Ucated in enhancing 

of transg^c alfalfa as well as providing resistance to enviromnraital toxins such as the diphenyl eth^ 
heibicide» adfluorien (McKersie, BJ>. et al. (1993) Plant PhysioL 103:1155-1163). In addtion, yeast 
ceUs become more resistant to freeze-tfaaw damage following exposure to hydrogen peroxide which 
causes the yeast cells to adapt to further peroxide stress by upregulating expression of siq[>eroxide 

20 dismutases. In this study, rnutations to yeast superoxide disnnitaseg^iies had a nu3xede 
efTect on freeze-thaw resistance than mutations which affected the regulation of g^hitalbione 
metabolism, long suspected of being inq;»ortant in detemuning an organism's survival tinough the 
process of ccyopreservation (Jragrbi P^ J.-L ^ aL (1998) J. Biol. Chem. 273:22921-22928). 

Expressicm of superoxide dismutase is also associated with Mvcobacterimn tuberculosis, the 

25 organism that causes tuberculosis. Siyeroxidedismiitase is one oftiie tea rrajorprotrins secreted!^ 
'M. tuberculosis and its ejqnession is upr^gulated approximately S^old in response to oxidative stress. 
M. tuberculosis eaqnnesses almost two orders of w^gP^^^ ^fipigmyidft digmntafift than th^ 
nanpatfaoeemic mvcobocterinm NL sn y pw^tig «nd s»m*^ a math hipher |iroporticm nf tfia expressed 
enzyme. The result is Ae secretion of '^35(>-f old more enzvme by M. tuber culosis than M. smeematis. 

30 providing substantial resistance to oxidative stress (Harth» O. and Horwitz, MA. (1999) J. Biol. 
Ghent 274:4281-4292). 

The reduced expression of copper-zinc sqperoxide dismutases, as well as oth^r enzymes with 
anti-oxidantcq^abilities, has been inqplicated in tiie early sta^ of cancer. The expression of copper- 
zinc superoxide dismutases has heea shown to be lower in prostatic intraepithelial neoplasia and 

35 prostate carcinomas, compared to normal prostate tissue (Bostwick, D.G. (2000) Cancer 89: 123-134). 
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Phosphodiesterases 

Phosphodiesterases make up a class of enzymes which catalyze the hydrolysis of one of the 
two ester bonds in a phosphodiestercooopound. Hiosphodiest^rases are dieie£D]:e crucial to a variety 
of cellular processes. Phosphodiestenises include DNAaiulRNAradonucleases and exonucleases, 
which are essential for cell growth and rq>licatiQn, and topoisom^rases* which break and rejoin 
nucldc acid strands during topological rearrangement of DNA. A Tyr-DNA phosphodiesterase 
fimcdons in DNA repair by hydrolyzing dead-eoid covalent intermediates formed between 
topoisomerase I and DNA ^ouliot, JJ. et al. (1999) Science 286:552-555; Yang, S.-W. (1996) Proc. 
Natl. Acad- Sd. USA 93:11534-11539). 

Acid sphingon^elinase is a phosphodiesterase which hydrolyzes die mendxrane phospholipid 
sphingon^elin to produce ceramide and phosphorylcholine. Phosphorylcholine is used in the 
synthesis of phosphatidylcholine, which is involved in numerous intracellular signaling pathways, 
while coramide is an essential precursor for the generation of gangliosides, membrane lipids found in 
hig^ concentration in neural tissue. Defective acid sphingomyelinase leads to a build-up of 
sphingomyelin molecules in lysosomes, resulting in Niemann-Piclc disease (Schuchman, EJL and 
SJR. Miranda (1997) Genet Test 1:13-19). 

dycorophosphoryl diester phosphodiesterase (also known as gtycerophosphodiester 
phosphodiesterase) is a phosphodiesterase which hydrolyzes deacetylated phospholipid 
glycensf^osphbdiesteis to produce SD-glyce]x>l-3- GlycerDphdsphocholine, 
glycerophosphoethanolamine, glycoqphosphoglycerol, and glycerophosphoinositol are examples of 
substrates for glycerophosphoryl diester phosphodiesterases. A glycerppho^horyl diester 
phosphodiesterase firom E. coK has broad specificity for gtycerophosphodiester substrates (Larson, 
T J. et aL (1983) J. Biol. Chem. 248:5428-5432). 

Cyclic nucleotide j^bioqrihodiesteiases (PDEs) are crucial en^mes in the r^;ulation of the 
cyclic nucleotides cAMP and cGMP. cAMP and cGNfP function as intraceUular second rnessaoigeis 
to transduce a variety of extracellular sigiialsincfaidinghoni^ PDEs 
degrade cyclic nucleotides to tfieir corresponding monophoqd^^ flieieby regulating the 
intracellular conc^itraticHis of cyclic imcleotides and tfieir effects on signal transdoctiaa. Due to their 
roles as regulators of signal transduction, PDEs have hcea extensively studied as chonotherapeutic 
tasesits (Peny, M J. and GJL Higgs (1998) Curr. OpnL Chem. Biol. 2:472-481; Torphy, J.T. (1998) 
Am. J. Resp. Grit Care Med. 157:351-370). 

Families of mammalian PDEs have been classified based on tfadr substrate spedfidty and 
afi5nity, sensitivity to cofactors, and sensitivity to inhibitory agents (Beavo, J A- (1995) PhysioL Rev. 
75:725-748; Conti, M. et aL (1995) Endocrine Rev. 16:370-389). Several of these ferailies contain 
distinct^es,many of which are e^qiressed in different tissues as spl^ WithinPDE 
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fiunilies, there aie muldple isozymes and multiple splice variants of these isozymes (Contt, M. and S.- 
L.C Jin (1999) Prog. Nucleic Acid Res. MoL Biol. 63:1-38). Hie existence of multiple FDB 
&niilies, isozymes, and splice variants is an indication of the variety and complexity of the regulatory 
pathways involving cyclic nucleotides (Houslay, MD. and O. Milligan (1997) Trends Biochem. Sd. 
5 22:217-224), 

Type 1 FDEs(PDEls) are C^^/calmodulin-H]^)endent and appear to be encoded^ 
three different genes, each having at least two different splice variants (KaJdcar, R. et aL ( 1 999) Cell 
Mol. Life Sci. 55:1164-1186). PDBls have been found in the lung, heart, and brain. SomePDEl 
isozymes are regulated in vitro by phosphorylation/d^hosphoiylation. Phosphorylatioa of these 

10 PDEl isozymes decreases the affinity of the enzyme for calmodulin, decreases PDB activity, and 
mcreases steady state levels of cAMP (Elakkar, supraV PDBls noay provide useful therapeutic targets 
for disorders of die central nervous system and the cardiovascular and immune syst^ns, due to the 
involvement of PDEls in both cyclic nucleotide and calcium signaling (Peny, M.J. and G.A. Ifiggs 
(1998) Curr. Qpin. Chem. Biol. 2:472-481). 

IS PDE2s are cGMP-stimulated PDEs that have been found in the cerebellum, neocortex, jbeart, 

kidney, lung, pulmonary artery, and skeletal muscle (Sadhu, K. et al. (1999) J. IBstocheia Cytoch^oiL 
47:895-906). PDB2s are tfaoug|it to mediate the effects of cAMP on catecholamiiie secretion, 
participate in the regulation of aldostoDone (Beavo, supra\ and play a role in ol&ctory signal 
transduction (Juilfe, DM. et al. (1997) Proc. Natl. Acad. ScL USA 94:3388-3395). 

20 PDE3s have higjbi afSnity for bodi cGMP and cAMP, and so these cyclic nucleotides act as 

competitive substrates for PDE3s. PDE3s play roles in stimulating nxyocardial contractility, 
inhibiting platelet aggregation, relaxing vascular and airway smooth muscle, inhibiting proliferation 
of T'lynq)hocytes and cultured vascular smooth muscle cells, and regulating catecholamine^induced 
releaseof free fatty acids from adipose tissue. The PDES family of phosphodiesterases are sensitive 

25 to specific inMbitois such as dlostainide, enoxirnone, and lixaztnone. Isozymesof PDE3 canbe 
regulated by c AMP-d^nd^t protein kinase, or by insulin-dqpendent kinases (Degmnan, B. et al. 
(1997) J. BioL Chem. 272:6823-6826). 

PDE4s are specific for cAMP; are localized to airway smooth muscle, the vascular 
endc^ieUum, and all inflammatory cells; and can be activated by cAMP-dqpendent phosphorylation. 

30 Since elevation of cAMP levels can lead to siq)pression of inflammatory cell activation and to 

relaxation of bronchial smooth muscle, PDE4s have been studied extensively as possible targets for 
novel anti-inflammatory agents, with special enq)hasis placed on die discovery of asthma treatments. 
PDE4 inhibitors are currently undergoing clinical trials as treatnu^ts for asthma, chronic obstructive 
pulmonary disease, and atopic eczema. All four known isozymes of PDE4 are susceptible to the 

35 inhibitor rolipram, a conq)Ound which has been shown to inqirove behavioral memory in mice (Barad, 
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M. et al. (1998) Proc. Nad. Acad. ScL USA 9S:1S0!^1SQZS). PDE4 inhilntars have also been 
studied as possible therapeutic agents ag9iost acute lung injury, endotoxemia, ifaeumatoid arthritis, 
multiple sclerosis, and various neurological and gastrointestinal indications (Doherty, AM. (1999) 
Cunr. Opin. Ch^ BioL 3:466-473). 

5 PDES is Uglily selective for cGMP as a substrate (Torko, I.V. et aL (1998) Biochemistry 

37:42004205), and has two allosteric cGMP-spedfic landing sites (McAllister-Lucas, LM. et al. 
(1995) J. Biol. Ghent 270:30671-30679). Binding of cGMP to fhese allosteric binding sites seems to 
be hnportant for phosphorylation of FDE5 by cGMF-dependent protdn kinase rather than f (»r direct 
regulation of catalytic activity. High levels of PDE5 are found in vascular smooth nonscle, platelets, 

10 lung, and kidney. The inhibitor zaprixiast is effective against PDE5 and PDBls. Modification of 
zaprinast to provide specificity against PDES has resulted in sildenafil (VIACSRA; Pfizer, Luc, New 
Yodc NY), a treatment for male erectile dysfimction (Terrett, N. et aL (1996) Bioorg. Med. Chem. 
Lett 6:1819-1824). hihibitors of PDE5 axe currently being studied as agents for cardiovascular 
thaapy OPerry, M.J. and GA. Higgs (1998) Curr. 0pm. dem. Biol. 2:472-481). 

15 PDE6s, the photoreceptor cyclic nucleotide phosphodiesterases, are crucial conqponents of 

the phototransduction cascade. In association with the G-protein transdudn, PDE6s hydrolyze cGMP 
to xi^;ulate cGMP-gated cation chamiels in photoreceptor meinhranes. la addition to the cGMP- 
bmding active site, n>E6s also have two high-afBnity cGMP-binding sites which are thouglit to play 
a regulatory role in PDE6 function (Artemyev, N.O. et al. (1998) Methods 14:93-104). Defiects m 

20 PDE6s have been associated with retinal disease. Retinal degeneration in the rd mouse (Van, W.et 
al. (1998) Invest OpthalmoL Vis. Sd. 39:2529-2536), autosomal recessive retinitis pigmentosa in 
humans (Dandger, M. et al. (1995) Genomics 30:1-7), and rod/cone dysplasia 1 m Irish Setter dogs 
(Suber,MJL et aL (1993) Proc. NatL Acad Sd. USA 90:3968-3972) have been attributed to 
mutations in the PDE6B gene. 

25 The PDE7 family of PDEs crasists of only one known mranber having mult^le splice 

variants (Blooin,TJ. and JJLBeavo (1996) Proc. NatL Acad. Sd. USA 93:14188-^^^ PDE7s . 
are cAMPqpedfic, but little else is known about tiidrphysido^calfi^ Aldioug^miRNAs 
encoding KDE75 are found in skeietai mosde, heart, bfain, lung* kidney, and pancreas, eqiressioD of 
PDE7 protdns is restricted to specific tissue types (Han, P. et al. (1997) J. BioL Gbem. 272: 16152- 

30 16157; Perry, MJ. and G.A. Ifiggs (1998) Curr. Opin. Oiem. BioL 2:472-481). PDE7s are very 

closely related to the PDE4 family; however, PDE7s are not inhibited by rolipram, a specific inhibitor 
of PDE45 (Beavo, supra) . 

PDE8s are cAMP specific, and are closely related to the n>E4 family. PDE8s are caressed 
in thyroid gland, testis, eye, Uvct, skeletal muscle, heart, kidney, ovary, and brain. The cAMP- 

35 bydrolyzmg activity of PDE8s is not inhibited by the PDB inhibitors rolipram, vinpocetine, milrinone, 
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IBMX (3-kobut^l-l-iiiethylxaiittime), or z^iinast, but PDE8s aie inhibited by dipyridamole (Hsher, 
DJl et al. (1998) Biochem. Biophys. Res. Conmnin. 246:570-577; Hayashi, M. et al. (1998) 
BiochenL Biophys. Res. Conmum. 250:751-756; Sodeiling, S.H. et al. (1998) Proc. NatL Acad. ScL 
USA 95:8991^996). 

5 PDE9s are cGMP specific and most closely lesemble the PDE8 family of PDEs. PDE9s are 

e3q)iessed in kidney, liver, lung, brain, spleen, and small intestine. PDE9s are not inhibited by 
sQdenafil (VIAGRA; PBzer, Inc., New York NY), roliiHtam, vmpocetine, dipyridamole, or IBMX (3- 
isobotyl-l-methylxanthine), but they are sensitive to the PDES inhibits zaprinast (Rsher, DA. et aL 
(1998) J. Biol. Chem. 273:15559-15564; Soderling, S JEL et al. (1998) J. Biol. ChraL 273:15553- 

10 15558). 

PDElOs are dual-substrate PDEs, hydrolyzing both cAMP and cGMP. PDElOs are expressed 
in brain, tihiyroid, and testis. (Soderling, S.H. et aL (1999) Proc. Nafl, Acad. Sci. USA 96:7071-7076; 
Fujishige, K. et al. (1999) J, BioL Chem. 274:18438-18445; Loughney, K. et al (1999) Gene 234:109- 
117). 

15 PDEs are conqx>sed of a catalytic domain of about 270-300 annuo acids, an N4ermmal 

legdlatcxry domain responsible for binding cofactors, and, in some cases, a hydrophilic C-tenninal 
dcMnain of unknown function (Omti, M. and S.-L.C. Jin (1999) Prog. Nucleic Acid Res. Mol. Biol. 
63: 1-38). A conserved, putative zfaac-binding motif has been identified in the catalytic domain of all 
PDEs. N-4enninal regulatory domains include non^atalyticc^^ 

20 PDE5s, and PDE6s; cabnodulin-binding domains in PDEls; and domains containing phosf^orylation 
sites in PDE3s and PDE4s. hi PDE5, the N^emunal cGMP-binding domain spaiis about 380 amino 
acid residues and conopises tandem t&pGats of a conserved sequence motif (McAllister-Lucas, LM. 
etaL (1993) J. Biol. Chem. 268:22863-22873). TheNKJ&iDmotif has been shown by mutagenesis to 
be important for cGMP Unding (TuikD, LV. ^ al. (1996) J. BioL Oiein. 271:22240-22244). FDE 

25 f aimlies display appnndmately 30% amino add identity within the catalytic dc»naui; however, 

isozymes within the same family typically di^lay about 85-95% identity in this legton (e.g. PDE4A 

VSPDE4B). Furtiiermofe, within a femifytfaero is extensive similad^ 

domain; while across families, fliere is little or no seqnence samilarity outside this domain. 

Many of ttie ramtit^TfUt fimrtimiff Af irmm^e mflaTimwitnry TesprmoftR arp. mlnhitPiil liy 

30 agqnts tiiat mcrease mtracellular levels of cAMP (Vergliese, MW. et al. (1995) MoL Pharmacol. 
47:1164-1171). A variety of diseases have been attributed to mcreased PDE activity and associated 
with decreased levels of cyclic nucleotides. For example, a form ofdiabetes insipidus in inice has 
been associated with increased PDE4 activity, an mcrease in low-K^ c AMP PDE activity has beoi 
lepcHled in leukocytes of atopic patients, and PDES has been associated with cardiac disease. 

35 Many inhibitors of PDEs have been identified and have undergone clinical evaluation (Perry, 
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MJ. and GA. Higgs (1998) Curr. Opin, C3iem. Biol. 2:472-481; T<Mphy, TJ. (1998) Am. J. Respir. 
Crit Caie Med. 157:351-370). FDE3 inhibitors axe being developed as antitfaionibotic agoits, 
antihypertensive agents, and as cardiotonic agents useful in tibe treatment of congestive heart failure. 
Rolipram, a FDE4 inhibitor, has been used in the treatment of depression, and other inhibitors of 

5 PDE4 are undergomg evaluation as anti-inflammat(»ryagra)ts. Rolipram has also been shown to 
inhiUt lipopolysacdbaiide (LPS) induced TNF-a which has bem shown to enhance HTV-l replication 
in vitro. Therefore, rolipram may inhilxit HIV-1 replication (Angel, J.B. et al. (1995) AIDS 
9:1137-1144). Additicmally, rcdipram, based on its ability to s(q)press the production of cytokines 
such as TNF-a and P and interfieron y, has been shown to be effective in the treatment of 

10 encephalomyelitis. RoUpramnmy also be effective in treating tardive dysldnesia and was effective in 
treating multiple sclerosis in an e3q)edmental animal model (Sommer, N. et aL (1995) Nat Med. 
1:244-248; Sasaki, H. et aL (1995) Eur. J. Pharmacol. 282:71-76). 

Theopl^lline is a nonspecific FDE inhibitor used in the treatment of bronchial asthma and 
other respiratory diseases. Thecqdiylline is beUeved to act on airway smooth imiscle function and in 

15 an anti-inflammatory or immunomodulatory capacity in die treatment of respiratory diseases (Banner, 
KJL and CP. Page (1995) Eur. Respir. J. 8:996-1000). Pentoxifylline is anodic nonspecific PDE 
inhilntor used in the treatmrnt of intermittrat claudication and diabetes-mduced peripheral vascular 
disease. Pentoxifylline is also known to block TNF-a production and may inhibit HIV-1 replication 
f Angd ei al.. siqiraV 

20 PDEs have been reported to affect cellular proliferation of a variety of cell types (Conti et al. 

(1995) Bftdocrine Rev. 16:370-389) and have be^ in^licated in various cancers. Growth of prostate 
carcinoma cell lines DU145 and LNCaP was infailnted by delivery of cAMP derivatives seA PDE 
mhibitors (Bang, Y J. et al. (1994) Ptoc. NafL Acad. ScL USA 91:5330-5334). These cells also 
showed a permanent conversion in i^benotype from q[»[tbelial to neuronal moarphology. B has also 

25 be^ suggested diat PDE inhibitors have the potential to regulate mesai^gial cell profifeiation 

(Matousovic K. et al. (1995) J. din. Invest 96:401-410) and lyn^ocyte prolifiBiaticn (Joulain, C et 
aL(1995)J.IipidMediatC^ Signal. 11:63-79). A cancer treatment has been described that 
involves intracdfadar ddiveiy cf PDEs to particular ceSular CQn^artniBnts of tomois, resulting in cell 
death (Deonarain, MP. and AA. Qpmetos (1994) Br. J. Cancer 70:786-794). 

30 PhosphotriestCTases 

Phosphotriesterases (PTE, paraoxonases) are enzymes that hydrolyze toxic organophosphorus 
CQriq)ouiids and have been isolated fixnn a variety of tissues. The CT^mes^pear to be lacking in 
birds and insects and abundant in mammals, explaining the reduced tolerance of birds and insects to 
organc^hosphorus conq>ound (Vilanova, E. and Sogcnb, MA. (1999) CriL Rev. ToxicoL 29:21-57). 

35 PhosplK^esterases play a cmtral role in die detoxification of insecticides by mammals. 
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Fhospbotriesteiase activity varies among individuals and is Iowa: in infants than adults. PTE 
knockout mice axe markedly more sensitive to the oiganopfaosphate-based toxins diazoxon and 
chloipyrifos oxot CPurirag, CJEL, et al. (2000) Neuiotoxicology 21:91-100). PTEs have attracted 
interest as aizymes oqpable of the detoxification of organophosphate-containing chemical waste and 
5 warfaie reagents (e.g.,paiatfaion), in addition to pesticides and insecticides. Some studies have also 
in:qplicated phosphotiiesterase in atiierosclerosis and diseases involving lipoprotein n^tabolisnL 
Thioesterases 

Two soluble tiuoesterases involved in fstiy acid biosynthesis have been isolated from 
maimnalian tissues, one wMch is activc <Hily toward long-chain &tty-acyl tfaioesters and one which is 

10 active toward thioesters with a wide range of &tty-acyl chain-lengths. These Moestorases catalyze 
tlip- rKaiTi.ti*mriTiatnig Rtqp tn the /fe nnvn biosynthesis of fat^ adds, CSbain tsmsnation involves tiie 
hydrolysis of the thioester bond which links the fiitty acyl chain to the 4''phosphopante(heine 
prosthetic group of the acyl carrier protein (ACP) subunit of the fytty acid syntiiase (Smith, S. (1981a) 
Methods Enzymol. 71:181-188; Smitii, S. (1981b) Methods Enzymol. 71:188-200). 

15 g. c^ contains two soluble thioesterases, thioeslerase I which is active only toward Icmg- 

chain acyl thioesters, and thioest^rase n (TEH) which lias a broad chain-leiigth specificity (Naggert, J. 
et aL (1991) J. Biol. C3ieia 266:11044-11050). cdi TEH does not exMUt sequence sinnlarity with 
either of the two types of mammalian tfiioesterases which function as chain-l^cxmnatiDg enzymes in 
novo &tty add biosynthesis. Unlike tiiernanniialiantiiioesterases»g.£^TEQ lacks die 

20 characteristic senne active site gly-X-ser-X-gty sequence motif and is not inactivated by the serine 
modifying agesxt diisqpropyl fiuorophosphale. Howev^, nK)dification of histidine 58 by 
iodoacetamideaiuldiethylpyrocatfoonate abolished TED activity. C>vere3q^!ressi(Hiof TEDdidnot 
alter &tty add content in R coli which suggests that it does not function as a chain-teroainatiiig 
eoz3^me in fiittyaddbiosytttfae^(N^ggm^aL»sy|S§)- For that reascHi,NaggQtetal.(su£ra) 

25 proposed that the physiological sid>5trates for E.cofil^ 
instead of ACP-pho^hppanlfa^h£in&-fEitty add esters. 
Carboxylesterases 

Mammafian carboxylesterases constitate a nmltig^ funily expressed in a vari^ of tissues 
and cell ^pes. Isozyines have significant sequence hoinology and are classified prin^ 

30 bads of ammo add sequence. Acetylcholinesterase, bu^rylcholinesterase, and carboxylesterase are 
grouped into die serine sup^amily of esterases (B-esterases). Other carboxylesterases include 
thyrogl6bulin,thrQmtHn, Factor DC, gtiotactin, and (Carboxylesterases catalyze the 

hydrolysis of ester- and amide- groiq^s fiomiiK>lecules and are involved in detoxification of drugs, 
environmental toxins, and carcinogens. Substrates for carboxylesterases include short- and 

35 long-chain acyl-gjycmls, acylcamitine, caxbonates, dipivefiin hydrochloride, cocaine, salicylates. 
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cs^saicin, palmitoyl-coeiizyiis A, imidapril, haloperidol, pynolizidme alkaloids, steroids, 
p-mtrophenyl acetate, malathio n, butanilicaine, and isocatboxazide. The ^izymes often demonstrate 
low substrate specificity. Catboxylesterases are also inqportant foe the conversion of prodrugs to their 
respective free acids, which may be the active form of the drug (e.g., lovastatm, used to lower blood 
5 cholesterol) (reviewed in Satoh, T. and Hosokawa, M. (1998) Anna Rev. Phaimacol. 
ToxicoL38:257-288). 

Neuroligins axe a class of molecules diat (i) have N-terminal signal sequences, (ii) resemble 
cell-surfece receptors, (iii) contain carboxylesterase domains, (iv) are highly expressed in the brain, 
and (v) bmd to neurexins in a calcimn-dependent manner. Despite the homology to 
10 carboxylesterases, neuroligms lack the active site serine residue, inq>lying a role in substrate binding 
rather than catalysis (Ichtohe^o, EL et al. (1996) J. Biol. Chem. 271:2676-2682). 
Squalene epoxidase 

Squalene epoxidase (squalene monooxygenase, SB) is a microsomal membran&bound, FAD- 
dependent oxidoreductase that catalyzes the first oxygenation step in the st^l biosynlfaetic pathway 

15 of eukaryotic cells. Clholestecol is an essential structurd conQ>onent of cytqvlasinic in^ 

acquired via the IJ!>L receptor-mediated pathway or the biosynthetic pathway. Li the latt^ case, all 
27 carbon atoms m the cholesterol molecule are derived from acetyl-CoA (Stryer, L. , supral . * SB 
converts squalene to 23(5)-oxidosqualene, which is then converted to lanosterol and thm cholesterol. 
The steps involved in chdestml biosynthesis are summarized below (Stryer, L. (1988) Biochemistry . 

20 W.H Freeman and Co., Inc. New York. pp. 554-560 and Sakakibara, J. et aL (1995) 270:17-20): 

acetate (fix>m Ace^l-CoA) 3-hydoxy-3-methyl-glutaryl CoA mevalcmate S-phosphomevalonate 

5-pyrophosphomevalonate isopentenyl pyrophosphate dimethylallyl pyrophosphate geranyl 
pyrophosphate -* fiamesylpyro^iosphate squalene -» squalaaeqx>xide lanosterol cholesteroL 
While cholesterol is essmtial for the viability of eukaryotic cells, inordinately hig^ serum 

25 cholesterol levels result in the formation of atherosclerotic plaques in the arteries of higjier 

organisms. This deposition of highly insoluble lipid material onto the walls of essential blood vessels 
(e.g., CCTonary arteries) results in decreased blood flow and potential necroos of the tissues deprived 
of adequate blood flow. HMG^CoA reductase is responsible for the conveisicm of 3-hydroxyl-^ 
methyl-glutaryl CoA (HMG-CoA) to mevalonate, which rqiresoits the first committed step in 

30 cholesterol biosynthesis. HMG-CoA is the target of a number of pharxnaceutical conq)ounds 

designed to Iowct plasma cholesterol levels. However, inhibition of MHG-CoA also results in the 
reduced synthesis of non-sterol intermediates (e.g., mevalonate) required for other biochemical 
pathways. SE catalyzes a rate-lindting reaction that occurs lat^ in the sterol synthesis pathway and 
cholesterol is the only end product of the pathway following the step catalyzed by SK As a result, SB 

35 is the ideal target for the design of anti-4iyperlipidemic drugs that do not cause a reduction in other 
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necessary intennediates (Nakamma, Y. et al. (1996) 271:8053-8056). 
Epoxide hydiolases 

Epoxide hydrolases catalyze the addition of water to epoxide-containing cQnq>oiiiids, thereby 
hydrolyzing epoxides to their conespoading 1^-diols. They are related to bacterial haloalkane 

5 dehalogenases and show sequeiK:e smnlarity to other meoibers of the a/p hydrolase fold family of 
enzymes (e.g., bromoperoxidase A2 fiom Streptomvces aureofeciens, hydroxymuconic semialdehyde 
hydrolases from Psendomonas putida. and haloalkane dehalogenase fcom^mOobac^ 
antotrophicusV Epoxide hydrolases are ubiquitous in nature and have been foniid in mammals, 
mvertebrates,plaiits,fimgi,andbact^ia. Thisfanuly of enzymes is in^ortant for the detoxifi 

10 of xenobiotic epoxide coixqpounds which are often highly electrophilic and destructive when 

mtcoduced mto an oiganisnL Exanq>les of epoxide hydrolase reactions include the hydrolysis of 
cis-9,10-epoxyoctadec-9CZ)-enoic add (leukotoxin) to form its correspoiKfing diol, 
ffareo-9,10-dihydroxyoctadec-12(Z)-enoic add Qeukotoxin diol), and the hydrolysis of 
ds-12,13-epoxyoctadec-9CZ)-enoic add (isoleukcAoxin) to form its corresponding diol 

15 threo-12, 13-dihydroxyoctadec-9(Z)-enoic acid (isoleukotoxin diol). Leukotoxins alter membrane 
penneabiUty and ion tnmspoit and cause inflaimnatoiy responses, hi addition, epoxide carcinogens 
are known to be produced by cytochrome P450 as mtermediates in the detoxification of drugs and 
environmental toxins. 

Hie en^mes possess a catalytic triad consposed of Asp (the nucleophile). Asp (the 

20 histid]iie-suppQrtiiigadd),andI£s(thewat^-activatiiighi5tidine). The reaction medianism of 
epoxide hydrolase proceeds via a covalentiy bound est^ intermediate initiated by the nucleophilic 
attack of one of the Asp residues on the primary carbon atom of the epoxide ring of th^ 
molecule^ leading to a covaleotly bound ester intermediate (Arand, M. et aL ( 1996) J. Biol. Oxm. 
271:4223-4229; Rink, R. et aL (1997) J. BidL CbeaoL 272:14650-14657; Aigiriadi, MA. et aL (2000) 

25 J. BioL Oiem. 275:15265-15270). 

FjizymRR iwnl ved in tyrosine catalysis 

Tlie degradation of the amino add tyrosine, to dtiier succinate and pyruvate or fmmrate and 
ac^oac^ate, requires a large number of enzymes and generates a large number of intermediate 
compounds. In addition, xnanyxenoMoticconqx^undsinay be inetabolized using one or 

30 reactions that are part of the tyrosine catabolic pathway. While the pathway has been studied 

primarily in bacteria, tyrosine degradaticm is known to occur in a variety of organisms and is likely to 
involve many of the same Mological reactions. 

The en29mes involved in the degradation of tyrosine to succinate and pyruvate (e.g. , in 
Arthrobacter spedes) include 4-hydroxyphenylpyruvate oxidase, 44iydroxyphenylacetate 

35 S^ydroxylase, 3,4-dihydroxyphBnylacetate 23-dioxygenase» 5-carboxymethyl-2-hydroxyinnconic 
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ssDoialdehyde dehydrogenase^ trans,cU'5<saiboxyiDe&^^ isomerase, 
homoprotocatechuate isomerase/decaibaxylase, d^-2-oxohept-3-en&-l,7-dioate hydiatase, 
2»4-dihydroxyhq9t--lra7u-2-^^ aldolase, and succinic semialdehyde dehydrogenase. 

The enzymes mvolved m the degradation of tyrosme to fnmaiate and acetoacetate (e.g., m 
5 Pseudomonas species) include 4-hydroxyphraylpynivate dioxygenase, homog^tisate 

l,2-dioxygenase,nialeylacetoacetateisonierase,andfum^ 4jiydroxyphenylacetate 
l^iydroxylase may also be involved if intennediates from the succinateApyruvate pathway are 
accq^ted. 

Additional enzymes associated with tyrosine metabolism in diffeient organisms include 

10 4-chlGroplienylacetat&-3,4-dioxygra[iase» aromatic aminotransf^ase, 

S-oxopent-3-eQ&-l,2^-lricaiboxylate decarboxylase, 2-oxo-hept-3-ene-l,7-dioate hydratase, and 
S-caifooxyniethyl-2-hydroxymucQnate isomerase (Ellis, LJBM. et al. (1999) Nucleic Acids Res. 
27:373-376; Wactett, UP. and Ellis. LJB.M. (1996) J. Microbiol. Meth. 25:91-93; and Schmidt, M. 
(1996) Amer. Soc. ^Gcrobiol. News 62:102). 

15 In humans, acquired or inherited ^netic defects in en^mes of the tyrosine degradation 

pathway may result in hereditary tyrosinenua. One fonn of this disease, heieditary tyrosinemia 1 
(HTl) is caused by a deficiency in the enzyme fumarylacetoacetate hydrolase, the last enzyme in the 
pathway in nrganisms that mf tab^lizff tyn>anp' fiT™ tr »^ft flrftfr n«« *tat ft . HTl is charactmzed by 
progressive liver damage begumiiig at infancy, and increased risk for liver cancer (Endo, F. et al. 

20 (1997) J. BioL Chem. 272:24426-24432). 

The discovery of new drug metabolizing enzymes, and tiie polynucleotides racoding them, 
satisfies a need in the art by providing new compositicHis which are useful in the diagnosis, 
preveaticMi, and treatment of autohrmrnnf/inflammatory, ceD proliferative, neurological, 
developmental, osdocrine, eye, mi^abolic and gastrointestinal disorders, including Hver disoiders^ 

25 and in die assessment of the e%cts of exogenous compounds on the exprossioii of nucleic add and 
amino add sequ^K:es of drug metabolizing enzymes. 

SUMMARY OF THE INVENTION 

The invention features purified polypeptides, drug metaboliang oizymes, referred to 
30 coBectively as "DMF* and mdividually as •T)ME^1," *T)MB.2," •DMB.3," •T)M&4," •T>M&.5," 
'VME^r *T)M&7," *T)MEr8," •T>ME-9," 'T>ME-10." *T>ME-11." T)ME-12," and 'TMErlSr In 
one aspect, the invention provides an isolated polypeptide selected from the groqp consisting of a) a 
polypq>tide conqtrising an ammo add sequsoce selected from the groq> consisting of SEQID N0:1- 
13, b) a polypeptide coiuprisnig a naturally occurring amino add sequence at least 90% identical to 
35 an amino add sequence selected from the group consisting of SEQ ID NO: 1-13, c) a biologically 
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active fragment of a polypeptide having an amino acid sequence selected fiom the groiq> consisthig of 
SEQ ID NO: 1-13, and d) an immunogenic fiagn^ of a polypeptide having an amino acid sequence 
selected from die group consisting of SEQ ID NO: 1-13. In one alternative* titeinvraticm provides an 
isolated polypeptide conqxrismg the ammo acid sequence of S3BQ ID NO: 1-13. 
5 The invention further provides an isolated polynucleotide encoding a polypeptide selected 

from the group consisting of a) a polypeptide conqvising an amino acid sequence selected from the 
group consistnig of SEQ ID NO: 1-13, b) a polypeptide conqprising a naturally occwring amino acid 
sequence at least 90% identical to an amino add sequence selected from the group consisting of SEQ 
ID NO: 1-13, c) a biologically active fragment of a polypeptide having an amino acid sequence 
10 selected from the groiq) consisting of SEQ ID N0:l-13, and d) an immunog^c fragment of a 

polypeptide havii^ an amino add sequence selected from the group consisting of SEQ ID NO: 1-13. 
In one altCTiative, the polynucleotide encodes a polypeptide selected from the group consisting of 
SEQ ID NO: 1-13. In another alternative, tiie polynucleotide is selected from the group consisting of 
SEQIDNO:14-26. 

15 Additionally, the invention provides a recombinant polynucleotide conqnri^ng a promoter 

sequence opmbly linked to a polynucleotide encoding a polypeptide selected from the group 
consisting of a) a polypeptide comprising an amino add sequence selected from the group consisting 
of SEQ n> NO:l-13, b) a polypeptide coaqprisiAg a naturally occurring amino add sequence at least 
90% identical to an amino add sequrace sdected fromtiie group consisting of SEQ ID NO: 1-13, c) a 

20 biologically active fragmmt of a polypeptide having an amino add sequence selected from the group 
consisting of SEQ JD NO: 1-13, and d) an immunog^c fragment of a polypeptide having an amino 
acid sequence selected from the groiq> consisting of SEQ ID NO: 1-13. In one alternative, the 
imrention provides a ceUtransfonned with the recondnn^ In another alternative, the 

invention provides a tcans^odc organism compiising ttie recombinant polynucleotide. 

25 The invention also provides a method for producing a polypeptide selected from the groqp 

consisting of a) a polypq^de cQoqnising an amino add sequ^^ 

of SEQ ID NO: 1-13, b) a polypeptide comprising a naturally occoning amino add sequence at least 
90% idcnticdl to an amino add sequence selected from the group consisting of SEQ ID NO: 1-13, c) a 
biolp^caDy active fragment of a polypeptide having an amino add sequence selected from the group 

30 consistuig of SEQ ID NO: 1-13, and d) an immunog^c fragment of a polypeptide having an amino 
add sequence selected from the grotq) consisting of SEQ ID NO:l-13. The method comprises a) 
cultuiing a cell under conditions suitable for expression of the polypeptide, whmin said ceQ is 
transformed with a recombinant polynucleotide conoprising a promoter sequence operably linksd to a 
polynucleotide encoding the polypeptide, and b) recovoing the polypeptide so e^qxressed. 

35 Additionally, the invention provides an isolated antibody which q)ecifically binds to a 

26 
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polypeptide selected from the group consistmg of a) a polypeptide comprising an amino add 
sequence selected from the group consisting of SEQ ID NO: 1-13, b) a polypeptide confirising a 
naturally occurring amino acid sequence at least 90% idraitical to an amino add sequence selected 
from the group consisting of SEQ ID NO: 1-13, c) a biologically active fragment of a polypeptide 

5 having an amino acid sequence selected from the group consisting of SEQ ID NO: 1-13, and d) an 
immunc^enic fragment of a polypef^de having an amino acid sequence selected fit>m the group 
consistmg of SEQ ID NO: 1-13. 

The invention further provides an isolated polynucleotide selected from the groi^ consisting 
of a) a polynucleotide comprising a polynucleotide sequence selected from the groiq> consisting of 

10 SEQ ID NO: 14-26, b) a polynucleotide comprising a naturally occurring polynucleotide sequence at 
least 90% identical to a polynucleotide sequence selected from the gjmfop consistmg of SEQ ID 
NO: 14-26, c) a polynucleotide conoplemmtary to the polynucleotide of a), d) a polynucleotide 
CQiqplementary to die polynucleotide of b), and e) an RNA equivalent of a)-d). hi one alternative, the 
polynucleotide comprises at least 60 contiguous nucleotides. 

15 Additionally, the mvraition provides a method for detecting a target polynucleotide in a 

sample, said target polynucleotide having a sequence of a polynucleotide selected from the group 
consisting of a) a polynucleotide con^sing a polynucleotide sequence selected from the group 
consisting of SEQ ID NO: 14-26, b) a polynucleotide comprising a naturally occurring polynucleotide 
sequence at least 90% identical to a polynucleotide sequrace selected frcm the group oxisisting of 

20 SEQ ID NO: 14-26, c) a polynucleotide compl^nentary to the polynucleotide of a), d) a 

polynucleotide conq>lementazy to the polynucleotide of b), and e) an RNA equivalent of a)-d). The 
method comprises a) hybridizbog the sanq>le with a probe conqxrismg at least 20 contiguous 
nucleotides conqxrising a sequence con^lementary to said target polynucleotide in the sample, and 
which probe specifically hybridizes to said target polynucleotide, under ccxiditions wheieby a 

25 hybridization coiiq>lex is fanned between said probe and said targ^ 

thereof, and b) detecting the presmce or absoice of said hybridization canq>lex, and optionally, if 
present, the amount thereof hi cmealteixiative, the probe conpdses at least 60 contiguous 
nucleotides. 

The invmtion frnther provides a metiiod for detecting a target polynucleotide in a sanople, 
30 said target polynucleotide having a sequence of a polynucleotide selected from the group consisting 
of a) a polynucleotide conq^rising a polynucleotide sequence selected from the group consistuig of 
SEQ ID NO: 14-26, b) a polynucleotide conqnising a naturally occurring polynucleotide sequence at 
least 90% identical to a polynucleotide sequence selected from die group consisting of SEQ ID 
NO: 14-26, c) a polynucleotide conq>lementary to the polynuclec^de of a), d) a polynucleotide 
35 conq)lementary to the polynucleotide of b), and e) an RNA equival^t of a)-d). The method 
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cQnqxdses a) aiiq)lifyiiig said taiget po^ucleotide or fragment theareof using polymerase chain 
reaction amplification, and b) delecting flie presence or absence of said amplified target 
polynucleotide or ficagment tb^peof , and, optionaOy, if present, the amount thereof. 

The invmtion fiirtiier provides a composition coEqxiising an effective amount of a 

5 polypeptide selected from the group consisting of a) a polypeptide comprising an anm 

sequence selected fiom die group consisting of SEQ ID NO: 1-13, b) a polypeptide comprising a 
naturally occurring amino acid sequence at least 90% identical to an amino acid sequence selected 
from the group consisting of SEQ ID NO: 1-13, c) a biologically active fragment of a polypq>tide 
having an anuno add sequooce selected from the group consistmg of SEQ ID NO: 1-13, and d) an 

1 0 unmunogenic fragment of a polypeptide having an amino add sequence selected from the group 

consisting of SEQ ID NO: 1-13, and a phannaceutically acceptable excipirat. fia one enibodunirait, the 
composition conqnrises an amino add sequence selected frx>m the groiq> consistmg of SEQ ID NO:l- 
13. The invention additionaUy provides a metiiod of treating a disease or condition assoda 
decreased e^tpression of functional DMB, comprising administBiing to a patient in need of such 

15 treatment the conqK>sition. 

The invention also provides a method for screening a conqxnmd for effectiveness as an 
agonist of a polypeptide selected from the group ccmsisting of a) a polyp^tide conqnrising an amino 
add sequence selected fixim the group consisting of SEQ ID NO: 1-13, b) a polyp^tide conqprising a 
naturally occurring amino add sequence at least 90% identical to an amino add sequence selected 

20 from the group consisting of SEQ ID NO: 1-13, c) a biologically active fragment of a polypeptide 
having an amino add sequence selected from the group consisting of SEQ ID NO: 1-13, and d) an 
immunog^c fragmoit of a polypeptide having an amino add sequence selected firom the group 
consisting of SEQ ID NO: 1-13. The mi^iod comprises a) esqwdng a sample comprising ttie 
polypeptide to a conqx)und, and b)d^ectnaigag(»iist hi one alternative, the 

25 invention provides a conqx)sitioii ccnnprising an ag^mist compound identified by the nK^faod and a 
pharxnaceutically accq[>table excipient hi anoth^ alternative, the inventicm provides a method of 
treating a disease or condition associated with decreased expiession of functional DMB, comprising 
admmistering to a patient in need of such treatment the conqposition. 

Additionally, the invention provides a m^od for screening a compound fm effectiveness as 

30 an antagonist of a polypeptide selected from the group consisting of a) a polypeptide conqprising an 
amino add sequence selected from the groiq> consisting of SEQ ID NO: 1-13, b) a polypeptide 
comprising a naturally occurring airrino add seq[uence at least 90% identical to an amino add 
sequence selected from the group consisting of SEQ ID NO: 1-13, c) a biologically active fragment of 
a polypeptide having an amino add sequence selected fix>m the groiq> consisting of SEQ ID NO:M3, 

35 and d) an immunogenic fragment of a polypeptide having an amino add sequence selected fix)mtixe 
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groop consisting ofSEQ ID NO: 1-13. Tbe mediod coniprises a) e^qK>sing a saixq)le compnsi^ 
polypeptide to a compound, and b) detecting antagonist activity in the sample, la one alternative, tiie 
invention provides a composition comprising an antagonist compound identified by die method and a 
phatmaceutically acceptable exdpient In another alternative* die invention provides a method of 
' 5 treatmg a disease or condition associated with overexpression of fimctioiialD]^^ 
administering to a patient in need of such treatment the composition. 

The inv^tion further provides a method of screening for a compound that specifically binds 
to a polypeptide selected firom tlie group consisting of a) a polypeptide comprising an amino acid 
sequence selected from the gcaap consisting of SEQ ID NO: 1*13» b) a polypeptide conqirising a 

10 naturally occurring amino acid sequence at least 90% idmtical to an amino acid sequence selected 
firom the group consisting of SEQ ID NO: 1-13, c) a biologically active fragment of a polypeptide 
having an amino add sequence selected finom the group consisting of SEQ ID NO: 1-13, and d) an 
immunogenic fi:agment of a polypeptide having an amino add sequence selected firom the groiq> 
consisting of SEQ ID NO: 1-13. The method con^iises a) combhung the polypeptide with at least 

15 one test compound under suitable ccmditions, and b) detecting binding of the polypeptide to the test 
conqpound, thmby identifying a compound that specifically bmds to tiie polypeptide. 

Tbe invration fiirther provides a method of scre^iing for a compound that modulates the 
activity of a polypeptide selected from the groiqp consistmg of a) a polypeptide conqnrising an ammo 
add sequence selected fix>mtiie groiq> consisting of SEQ ID N0:l-13, b) a polyp^tide comprising a 

20 naturally occurring amino acid sequmce at least 90% identical to an amino acid sequence selected 
from the groi^ consisting of SEQ ID NO: 1-13, c) a biologically active fi:agment of a polypeptide 
having an amino add sequence selected firom the groiq> consisting of SEQ ID NO: 1-13, and d) an 
immunogenic fisgmmt of a pofypeptide having an amino add sequence selected fitnn the group 
consisting of SEQ ID NO: 1-13. The m^bod comprises a) conilrini]]^ the poly^ 

25 one test compound under conditions pemdssive for the ar^vity of the polypeptide^ b) assessing tiie 
activity of tiie polypeptide in die presence of the test cQo^ourid, and c) <^ 

poIyp^)tide in tbe pres^ce of tfie test compound with tte activity of flie poIyp^)tide in the abs^ice 
of die test coiiqpound, wfaevein a dtaiige in the activity of die polyp^tide 
conqpound is indicative of a conqKHmd tiiat modulates the activity of die polypeptide. 

30 The invention further provides a method fisr screening a compound for efTectiveness in 

altering e;qitession of a target polynucleotide, wh^in said target polynucleotide conoqprises a 
polynucleotide sequrace selected firom the group consistmg of SEQ ID NO: 14-26, the metiiod 
cnm prising a) exposing a saiuple comprising the target polynucleotide to a CQnq)ound, b) detecting 
altered e>qxressionofthe target polynudeotide, and c) comparing the expression of the target 

35 polynucleotide in tfie presence of varying amounts of the compound and in the absence of the 
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c(Hxq>ound. 

The mvmtioa fiirdi^ provides a method for assessmg toxicity of a test compound, said 
method conqxrising a) treating a biological sample containing nucleic acids with the test con^und; 
b) hybridizmg die nucleic adds of the treated biological sample with aprobe conq>iisnig at least20 

5 contiguous nucleotides of a polynucleotide selected from the group consisting of i) a polynucleotide 
conpising a polynucleotide sequence selected from tibe group consisting of SEQ ID NO: 14-26, ii) a 
polynucleotide comprising a naturally occurring polynucleotide sequence at least 90% identical to a 
polynucleotide sequence selected from the group consisting of SEQ ID NO: 14*26» iii) a 
polynucleotide having a sequence conzplen^ntary to i), iv) a potynucleotide conq^lementaiy to the 

10 polynucleotide of ii), and v) an RNA equivalent of i>iv). Hybadization occurs under conditions 
whereby a specific hybridization conq)lex is formed between said pi^^ 
in the biological sanq>le, said target polynucleotide selected firom the group consisting of i) a 
polynucleotide comprising a polynucleotide sequence selected from the group consisting of SEQ ID 
NO: 14-26, ii) a polynucleotide conqxrising a naturally occurring polynucleotide sequence at least 

15 90% identical to a polynucleotide sequence selected ftom the group consisting of SEQ ID NO: 14-26, 
iii) a polynucleotide complementary to the polynucleotide of i), iv) a polynucleotide conq>lematitary 
to the polynucleotide of ii), and v) an RNA equivalent of i>iv). Alternatively, the target 
polynucleotide comprises a fragmimt of a polynucleotide sequence selected frcmi the group consisting 
of i>v) above; c) quantifying tiie amount of hybridization con^lm; and d) comparing the amount of 

20 hybridization conq>lex in die treated biological sanple with the amount of hybridization conplex in 
an untreated biological sample, wherein a difference in the amount of hybridi^ticm compl&a in the 
treated biolo^cal saiiq;)le is indicative of toxicity of the test conqKMmd. 

BRIEF DESCRiraON OF THE TABLES 

25 Table 1 gniifmiari»>g tftft nmrnmrlatwrn for thft fall lengtfi polynnclflnride and polypeptide 

sequences of the preset inventioiL 

homolog for polypqptides of Ae invention. The probability scores for the matches b^wera eadi 

polypeptide and its homolog(s) are also shown. 
30 Table 3 shows structural features of polypeptide sequences of the invention, including 

predicted motife and domains, along with the metiiods, algorithms, and searchable databases used for 

analysis of the polypeptides. 

Table 4 lists the cDHA and/or g^onnc DNA fragments which were used to assemble 

polynucleotide sequraces of tiie invention, along with selected fragments of the polynucleotide 
35 sequences. 
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Table 5 shows the repiesentative cDNA libaaiy for polynucleotides of the inventton. 

Table 6 provides an appendix which describes the tissues and vectors used for constmcdon of 
the cDNA libraries shown in Table 5. 

Table 7 shows the tools, programs, and algorithms used to analyze the polynucleotides and 
5 polypeptides of the invention, along with applicable descriptions, leferraces, and thieshold 
parameters. 

DESCRIPTION OF THE INVENTION 
Before the present protems, nucleotide sequences, and methods aie described, it is understood 
10 that this invmtion is not limited to the particular machines, mat^als and methods described, as these 
mayvary. It is also to be understood that the tennmolcygy used herein is for the purpose of d^ 
particular embodiments only, and is not intended to limit the scope of tibe present invention which 
will be limited only by the appended claims. 

It must be noted that as used herein and in the appended claims, the singular forms 'V "^an," 
15 and "the'* include plural reference unless die context clearly dictates otherwise. Thus, for example, a 
reference to "a host cell" includes a plurality of such host ceUs, and a reference to "an antibody" is a 
reference to one or more antibodies and equivalrats thereof known to those skilled in the art, and so 
fordL 

Unless defined otherwise, all technical and scientific terms used herein have the same 
20 meanings as commonly undostood by <xie of ordinary skill in the art to which this invention belongs. 
Although any machines, materials, and methods similar or equivalent to those described herein can be 
used to practice or test the present invention, the preferred tnnc^timftg^ materials and methods are now 
described. AU pubficaticHis ineiitioned heidn are cited for the puqpose 

the cell lines, protocols, reagents and vectors whidi are rqxnted in tfie publications and which might 
25 be used in connecticHi with the invention. Nothing herein is to be construed as an admission that the 
inventicm is not ^titled to antedate such disclosure by virtue of prior invention. 
DEFINITIONS 

"DME" refers to the amino acid sequ»x:es of substantially purified DME obtained fitnn any 
species, particularly a mammalian species, including bovine, ovine, porcine, murine, equine, and 
30 human, and from any source, whedier natural, synthetic, semi-^synthetic, or recombinant 

The term "ag<HU5t" refers to a molecule which int^isifies or miinics the biological activity of 
DME. Agonists may include proteins, nucleic adds, carbohydrates, small molecules, or any other 
c(Hiqx)und or composition which modulates die activity of DME either by directiy interacting with 
DME or by acting on conqxments of die biological padiway in which DME partic^>ates. 
35 An^allelicvaiiantf' is an alternative form of the gene encoding DME. Allelic variants may 
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tesult from at least one mutation in the nucleic acid sequ^ce and may result in altered mRNAs or in 
polypeptides whose stnictme or function may or may not be altered.* A g^e may have none, one, or 
many aUeUc variants of its naturaUy occurring fonn. ConmKniniutaticnial changes which ^ve rise to 
allelic variants are graierally ascribed to natural deletions, additions, or substitutions of nucleotides. 
5 Each of these types of changes may occur alone, or in combination with the others, one or more times 
in a ffvea sequence. 

'^Altered" nucleic add sequences encoding DMB include those sequences with deletions, 
insertions, or substitutions of differmt nucleotides, resulting in a polypeptide the same as DME or a 
polypeptide with at least one functional characteristic of DME. Included within this definition are 

10 polymorphisms which may or may not be readily detectable using a particular oligonucleotide probe 
of the polynucleotide ^coding DME, and iminx>per or unexpected hybridization to allelic variants, 
with a locus odi^ than the normal chromosomal locus for the polynucleotide sequence encoding 
DME. The encoded protein may also be ^'altered," and may contain deletions, insertions, or 
substitutions of amino acid residues which produce a silent change and result in a functionally 

15 equivalCTtDMK DeUberateaminoacidsubstitutionsmaybeinadeonthebasisof sinnlarity in 
polarity, charge, solubility, hydrophobicity, hydrq^ihilicity, and/or the amphipathic nature of the 
residues, as long as the biological or immunological activity of DME is retained. For example, 
negatively charged amino acids may include aspartic acid and glutamic acid, and positively charged 
amino acids may include lysine and arginine. Amino acids with uncharged polar side chains having 

20 similar hydrophilicity values may include: asparagiue and glutamine; and serine and threonine. 
Amino acids witii uncharged side chains having similar hydrophilicity values may include: leucine, 
isoleudne, and valine; glycine and alanine; and phenylalanine and tyrosine. 

The terms "annuo acid** and "amino acid sequmce" refer to an oHgopeptide, peptide, 
polypeptide, or protein sequence, or a fragment of any of these, and to naturally occurring or synthetic 

25 molecules. Where "ainino add sequence'' is redted to r^er to a sequence of a iiatorally occu^ 
protein molecule, "amino add sequence" and like terms are not meant to limit the amino acid 
sequence to die conqilete native amino acid sequence associated with the redted protein molecule^ 

"Anq>lification'' relates lo die production of additional copies of a nucleic add sequence. 
AinpMcation is graieraUy carried out using polyrnerase chain reaction • 

30 known in the art 

The term "antagonist" refers to a molecule which inhibits or attenuates the biological activity 
of DME. Antagonists may include proteins such as antibodies, nucldc acids, carbohydrates, small 
molecules, oar any c^er corqpound or coIr^x>sition which modulates the activity of DME either by 
directiy interactmg with DME or by acting on corr^ionents of the biological pathway in which DME 
35 partidpates. 
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The term *'antibod/* lefos to intact inmumoglobulm molecules as well as to fragments 
thereof, such as Fab, F(ab*)2, and Fv fragments, which are capable of Innding an epitopic determinant 
Antibodies that bhid DMB polypeptides can be prq>aced using intact polypq>tides or using fragnoi^ts 
containing small pqptides of interest as the immunizing antigen. The polypeptide or oligopeptide 
5 used to imnrnniTft an animal (e.g. , a mouse, a rat, or a rabbit) can be derived from the translation of 
RNA, or synthesized chemicaUy, and can be conjugated to a carrier protein if desh^ Qmnnonly 
used carriers that are chemically coiq>led to peptides include bovine serum albumin, thyroglobulin, 
and keyhole linipethemocyaninCECLH). The coupled peptide is thra used to immunize the animal. 
The t»m '^antigenic detBiininanf * refers to that region of a molecule (i.e., an q>itope) that 

10 makes contact with a particular antibody. Whra a protein or a fragment of a protein is used to 
inmnmize a host animal, numerous regions of the protem may induce the production of antibodies 
which bind specifically to antigenic determinants (particular regions or three-dimensional structures 
on the protein). An antigenic determinant may corqpete witii the intact antigen (i.e., the inmumogen 
used to elicit the immune response) for binding to an antibody. 

15 The tma **aptamer^' refers to a nucleic add or oligonucleotide molecule tiiat binds to a 

specific molecular target Aptamers are derived from an in vitro evolutionary process (e.g., SELEX 
(Systematic Evolution of Ligands by Exponential Enrichnmt), described in U.S. Patent No. 
5,270,163), which selects for target-specific aptamer sequences from lar^ combinatfinial libraries. 
Aptamer con^ositions may be double-stranded or single-stranded, and may include 

20 deo:xyribonucleotides, ribonucleotides, nucleotide derivatives, or other nucleotide-like molecules. 
The nucleotide components of an aptamer may have modified sugar groups (e.g., the 2*-OH groiq> of a 
ribonucleotide may be replaced by 2 -F or 2 -NHj), which may in^rove a desired property, e.g., 
redstance to nucleases or longer lifetime in blood. Aptamers may be conjugated to other molecules, 
e.g., a high molecular weight carrier to slow clearance of the aptamer fi^m the circulatory systeno. 

25 Aptamers may be specifically cross-linloed to their cognate ligands, e.g., by photo-activation of a 
cross-linker. (See, e.g., Brody, E.N. and L. Gold (2000) J. BiotechnoL 74:5-13.) 

The term "intramer^* refers to an aptamer which is e^gnressed in vivo . For exan^le, a vaccinia 
virus-based RNA e^n^ession system has been used to express specific RNA aptamers at hig^ levels in 
the cytoplasm of leukocytes (Blind, M. et al. (1999) Proc. Natl Acad. Sci. USA 96:3606-3610). 

30 The term "spieg^Imer*' refers to an aptamer which includes L-DNA, L-RNA, or other left- 

banded nucleotide derivatives or nucleotide-like molecules. Aptamers containing left4ianded 
nucleotides are resistant to degradation by imturally occurring enzymes, which normally act on 
substrates containing ri^t-4ianded nucleotides. 

The term "antisense" refits to any composition capable of base-pairing witii the "saise" 

35 (coding) strand of a specific nucleic acid sequence. Antisense conopositions may include DNA; 
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RNA; peptide nucleic acid (PNA); oligonucleotides having modified backbone linkages such as 
phosphorothioates, methylphosphonates, or braizylphosphonates; oligonucleotides having modified 
sugar groups such as 2 -metho^^ethyl sugars or 2 -methoxyedioxy sugars; or oligonucleotides having 
modified bases such as S-methyl cytosine, 2*-deo3cyuracil, or 7-deaza-2'-deoxyguanosine. Antisense 

5 moleculesnmy be produced by any method including chemicd synthesis or transcrqp Once 
introduced into a cell, the complementary antisense molecule bas&-pairs with a naturally occurring 
nucleic add sequence produced by the cell to form duplexes which block either transcription or 
translation. The designation *^gative" or '^minus" can refer to the antisense strand, and the 
designation *^)ositive" or *^lus" can refer to the sense strand of a referraice DNA molecule. 

10 The term 'Ixiologically active'* refers to a protein having structural, regulatory, or biochemical 

fimctioas of a naturally occuning molecule. likewise, "immunologically active" or '^umnunogemc" 
refers to the c^ability of the natural, recombinant, or synthetic DME, or of any oligopeptide thereof, 
to induce a specific immune response in appropriate animals or cells and to bind with specific 
antibodies. 

15 ''Complementary*' describes the relationship between two single-stranded nucleic acid 

sequences that anneal by base-pairing. For exanqple, 5 -AGT-3* pairs widi its complement, 
3*-TCA-y. 

A ''conqx)sition conq^rising a ffven polynucleotide sequence" and a ''conoposition conqprising 
a given amino add sequence** refer broadly to any composition containing the given polynucleotide 

20 or amino add sequence. Thecoiivositionmay comprise a dry formulation or an aqueous solu^on^ 
Compositions conqirising polynucleotide sequences encoding DME or fragments of DME may be 
employed as hybridization i»R>bes. The probes may be stored in fieeze-dri ed form and may be 
associated witii a stabilizmg ag^ such as a carbohydrate, hi hybridizations, tiie probe may be 
deployed in an aqueous solution ccmtaining salts (e.g., NaQ), det^gents (e.g., sodium dodecyl 

25 sul&te; SDS), and otiier conq>onents (e.g., Denhaidfs solution, dry milk, salmon spemi DNA, etc.). 

"Consensus sequence** refers to a nucleic add sequence which has been subjected to repeated 
DNA sequence analysis to resolve uncalled bases, extended using the XLrPCR kit (Applied 
Biosystems, Fost^ City CA) in the S* and/or tiie 3' direction, and resequoiced, or which has bem 
assembled fiom one or more overlapping cDNA, ECT, or gnomic DNA fragments using a coni^uter 

30 program for fragment assembly, such as the GELVIEW fragment assembly system (GCG, Madison 
WI) or Pbrap (University of Washington, Seattie WA). Some sequences have been both extended and 
assembled to produce the consensus sequence. 

"Conservative amino acid substitutions" are those substitutions that are predicted to least 
interfere with the properties of the original protein, i.e., the structure and especially the function of 

35 the protein is conserved and not significantiy changed by such substitutions. The table below shows 
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ammo acids which may be subsdtutBd for an ori^mal ammo acid in a protein and which are legaided 
as conservative ammo add substitutions. 





OH Pinal l{ Aid fine 
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IS 


Lys 


Arg, Ghi, Ou 




Met 


Leu, Be 




Phe 


IBs. Met, Leu, Trp, Tyr 




Ser 


Cys,Thr 




Thr 


Ser. Val 


20 


Tip 


Phe, Tyr 




Tyr 


His, Phe, Trp 




Val 


He, Leu, Thr 



Conservative anuno acid substitutions generally matutam (a) the structure of the polypeptide 
25 backbone in the area of the substitution, for exanq)le, as a beta sheet or alpha helical conformation, 
(b) the charge or hydrophoMcity of the molecule at the site of the substitution, and/or (c) the bulk of 
the side chain. 

A ^del^on** refers to a change in die amino acid or nucleotide sequmce that results in the 
absence of one or more amino acid residues or nuclec^des. 
30 Hie term "derivative" refers to a chemically modified polynucleotide or polypeptide. 

Chemical modifications of a polynucleotide can include, for example, replacement of hydros by an 
alkyi, acyl, hydroxyl, or amino group. A derivative polynucleotide encodes a polypeptide which 
retains at least one biological or immmiol€)gicalfimction of die natural A^rivative 
polypeptide is oo& modified by glycosylaticm, pegylation, or any shnOar process tfiat retains at least 
35 one Inological or immunological fimctim of the polypeptide firom which it was derived. 

A "detectable labeF refors to a reporter molecule or enzyme that is capable of generating a 
measurable signal and is covalentiy or noncovalentiy joined to a polynucleotide or polypeptide. 

*TMfierential e^qpression" refers to increased or upregulated; or decreased, downregulated, or 
absent gene or protein ^qiression, determined by cona^iaring at least t^ Such 
40 comparisons may be carried out betwe^ for example, a treated and an untreated san;)le, or a 
diseased and a normal sample. 

^'Exon shuffling** refi^ to the recombmatiou of diflS^nt coding regions (exons). Since an 
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exon may leptesent a structural or fimctioQal domain of the encoded pf otein, new proteins may be 
assembled through the novel leassortment of stable substructures, thus allowing accelaation of the 
evolution of new protein functions. 

A '"fragmenf * is a unique portion of DMB or the polynucleotide encodmg DME which is 
5 idratical in sequence to but shorter in length than the parent sequCTce. A fragment may coiipdse up 
to the entire l^agth of the defined sequence, minus one nucleotide/amino add residue. For exanq>le, a 
fragment may conqxrise from 5 to 1000 contiguous nucleotides or ainino add resi^ Afiagmmt 
used as a probe, primor, antigen, therapeutic molecule, or for other purposes, may be at least S, 10, 
IS, 16, 20, 25, 30, 40, SO, 60, 7S, 100, ISO, 2S0 or at least SOO contiguous nucleotides or ammo add 

10 residues in length. Fraginents may be preferi^tiaUy selected from certain regions of a molecu^ For 
exan:q>le, a polypeptide fiagmmt may conqxrise a certain lengtti of contiguous amino acids selected 
fKm the first 250 or 500 amino adds (or first 25 % or 50%) of a polypeptide as shown in a certain 
defined sequence. Qearly these lengths axe exeoiplaiy, and any length that is siQq>Q^ 
specification, including the Sequmce Listing, tables, and figures, may be encompassed by the preset 

15 eihbodiments. 

A fragment of SEQ ID NO:14-26 conpises a region of unique polynucleotide sequence that 
specifically identifies SEQ ID NO: 14-26, for exanq[>le, as distinct from any other sequence in the 
g^ome from which the fragment was obtained. A fragment of SEQ ID NO: 14-26 is usefrd, for 
example, in hybridization and amplification technologies and in analogous methods that distinguish 
20 SEQ ID NO:14-26firomrelated polynucleotide sequences. Thepredselengthof a fragment of SEQ 
ID NO:14-26 and the region of SEQ ID NO:14-26 to wMch the firagmmt corresponds are routinely 
deteiminable by one of onlinary skiU in the art based on the intended pur^ 

AfiagmentofSEQmNO:l-13isaKX)dedbyafiagmmtofSEQmN Afragment 
' of SEQ ID N0:l-13 conqprises a regicm of unique ammo add sequesice that spedfically identifies 
25 SEQIDN0:1-13. Fc^ exanq>le, a firagment of SEQ ID NO:l-13 is usefid as an immunogenic 

for the devdopment of antibodies ttiatspedficaUy recognize SEQ ID NO:l-13. Thepredselengthof 
a fragmi^ of SEQ n> NO:l-13 and the cegicm of SEQ ID NO:l-13 to iprtii^ 
correqKmds are loutindy detenmnable by oue of ordinary sldn in 
purpose for die fragment 

30 A "full Iragth" polynucleotide sequence is one containing at least a translation initiation 

codon (e.g.,inetUonine)foUowed by an op» reading frame and a traiislationtermro^ A 
'full lengdi" polynucleotide sequence encodes a *iidl length" polypeptide sequence. 

''Homology'* refers to sequence similarity or, interchangeably, sequence identity, between 
two or more polynucleotide sequences or two or more polypeptide sequaices. 

35 Hie terms '^percent identity** and identity,'* as applied to polynucleotide sequences, refer 
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to fb& percentage of residue matches between at least two polynucleotide sequences aligned using a 
standardized algorillmL Such an algorithm may insert, in a standardized and rq>roducible way, g^s 
in the sequences being conq>ared in order to optimize aligoinent between two sequences, and 
therefore achieve a more meaningful comparison of the two sequences . 
5 Percent identity between polynucleotide sequences may be determined using the defoult 

parameters of the CLUSTAL V algorithm as incarporated into the ME6ALIGN version 3.12e 
sequence alignment program. This program is part of the LASERGENE software paclcage, a suite of 
molecular biological analysis programs (DNASTAR, Madison WI). CLUSTAL V is described in 
Higgins, D.G. andPJM. Sharp (1989) CABIOS 5:151-153 and in Higgms, D.G. et aL (1992) CABIOS 
10 8: 189-191. For pairwise alignments of polynucleotide sequences, the default parameters are set as 
follows: Ktuple=2, g^ penalty=^, window^, and ^'diagonals saved"=4. The 'Veighted" residue 
weig^ table is selected as the default Percent identity is rep(»1ed by CLUSTAL V as the ''percent 
similarity** betwem aligned polynucleotide sequences. 

Alternatively, a suite of commonly used and freely available sequence conqKuison algorithms 
15 is jffovided by the National Craiter fin- Biotechnology Mcnmation (NCBI) Basic Local Alignment 
Search Tool (BLAST) (Altschul. SJF. et al. (1990) J. MoL Biol. 215:403^10), which is available 
from several sources, including the NCBI, Bethesda, MD, and on the Internet at 
ht4)://www.ncbijihiLnih.gQv/BLAST/. The BLAST software suite includes various sequence 
analysis programs including 1>lastn,'* that is used to align a known polynucleotide sequence with 
20 otherpolynucleotidesequoticesfromavariety of databases. Also available is a tool called '^LAST 2 
Sequences'' that is used for direct pairwise coiEQmrison of two nucleotide seqi^^ **BLAST2 
Sequences" can be accessed and used interactively at htq[>:y/www.ncbi.n1nLnih.gov/gp!rigbl2JitmL 
The 'ISLAST 2 Sequences'' tool can be used for bodiblastn and bla^ BLAST 
programs are commonly used with gap and oti^ param^ers set to defuilt settings. Fbrexample, to 
25 ccnqpare two nucleotide seqooaces, one may use blastn with the 'IBLAST 2 Sequences" tool Version 
2.0.12 (^pril-21-2(XX)) set at de&ultparam^ears. Su(^ default paiainetas may be, for exaxq[>le: 
Matrix. BLOSUM62 
Reward formatch: 1 
Penalty for mismatch: -2 
30 Open Gcqf: 5 and Extension Gap: 2 penalties 

Gap X drop-off: 50 
Expect: 10 
Word Size: 11 
Filter: on 

35 Percent identity may be measured over the length of an entire defined sequence, for exanq>le. 
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as defined by a particular SEQ n> nuinber, or may be measured over a shorter length, for exanq>le, 
over die length of a fragment taksn from a larger, defined sequence, for mstance, a firagment of at 
least 20, at least 30, at least 40, at least 50, at least 70, at least 1 00, or at least 200 contigaras 
nucleotides. Such Iragths axe exenqilary only, and it is understood that any fragment lengtii 

5 supported by the sequences shown h^rem, in the tables, figures, or Sequence Listing, may be used to 
describe a Iragtfa ov^ which percmtage identity may be measured. 

Nucleic acid sequences that do not show a high degree of identity may nevertheless encode 
simUar ainmo add sequeiK^ due to the degeneracy of the ^netic code. It is understood that changes 
in a nucldc add sequmce can be made using this degeneracy to produce multiple nucleic acid 

10 sequences that all encode substantially the same pxotdn. 

The phrases ^"percent identity** and identity," as applied to polypeptide sequ^u:es, refer to 
the percentage of residue matches between at least two polypeptide sequences aligned usiag a 
standardized algoritl&n. Methods ofpolypqptideseque[K:e alignment are well-knowrL Some 
aUgninent methods take into account conservative amino add substitutions Such conservative 

15 substitutions, explained in more detail above, gaaerally preserve the charge andjiydrophofaidty at the 
site of substitution, thus preserving the structure (and therefore function) of the polypeptide. 

Percent identity between polypeptide sequmces may be determined using the default 
parameters of die CLUSTAL V algorithm as incorporated into the MEGALIGN version 3.12e 
sequence alignment program (described and referenced above). Forpairwisealignmoitsof 

20 polypeptide sequoices using CLUSTAL V, the de&ult parameters are set as follows: Ktuple=l, g^ 
penalty^3, window^, and ^'diagonals saved"=^. The PAM250 matrix is selected as the default 
residue weight table. As with polynucleotide aUgnnients, the pen:ent identity is rq[)or^ 
CLUSTAL V as the '*fexcGBt ^nnladty^ betweoi aligned polypeptide sequence pairs. 

Alternatively the NCBI BLAST software suite may be used. For example, for a pairwise 

25 comparison of two polypeptide sequences, one may use the "BLAST 2 Sequences" tool Version 
2.0.12 (April-21-2000) with blastp set at default paraineters. Sucfadefiudtparamsteismay be,far 
exarqple: 

Matrix: BLOSUM62 

Open Gap: 11 and Extension Gap: 1 penalties 
30 Gapxdrop^: 50 

Expect: 10 
Word Size: 3 
FUter: an 

Perc^ identity may be measured over the l^gth of an entire defined polypq)tide sequence, 
35 for exaQq)le, as defined by a particular SEQ ID number, or may be nmasured over a shorts l^igtfa, for 
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exanq>le, over the length of a firagment taken fiom a laiger, defined polypeptide sequence, for 
instance, a fiagment of at least 1 5, at least 20, at least 30, at least 40, at least SO, at least 70 or at least 
ISO contiguous residues. Such lengths are exeniplary only, and it is understood that any fragment 
length supported by die sequences shown her^ in the tables, figures or Sequence listing, noay be 

S used to describe a Iragth over which p^rc^ita^ identity may be measured. 

'Human artificial chromosomes'* (HACs) are linear microchromosomes which may contain 
DNA sequences of about 6 ld> to 10 Mb in size and which ccmtain all of the elements required for 
chromosome r^lication, segregation and maint^iance. 

Tlie Cenn 'liumanized antibody" refisrs to an antibody molecule in which the amino add 

10 sequence in the non-^tigenbindixig regions has been altered so that die antibody mo 
resenibles ahuman antibody, and still retains its original bindiiig ability. 

'Hybridization" refers to the process by which a polynucleotide strand anneals with a 
con9>lemmtary strand through base pairing under defined hybridization conditions. Specific • 
hybridization is an indication that two nucleic add sequences share a high degree of complementarity. 

IS Specific hybridization complexes form under permissive annealing conditions and remain hybridized 
after die 'Vashing*' stqi(s). The washing step(s) is particulady inqiortant in detennining the 
stringraicy of the hybridization process, with more string^t conditions allowing less non-specific 
biQd]i\g,Le., binding between pairs of nucldc add strands that are not perfectiyiim Permissive 
conditions for annealing of nucldc add sequ^ices are routinely determinable by one of ordinary skill 

20 in the art and may be consistent among hybridization experiments, whereas wash conditicHis may be 
varied among experiments to achieve the desired stringency, and therefore hybridization specifidty. 
Pennissive annealing conditions occur, for exanq>le, at 68°C in the presmce of about 6 x SSC, about 
1% (w/v) SDS, and about 100 figAnl sheared, denatured salmon spennDNA. 

G^ierally» string^cy of hybridization is e^qiressed, in part, with refer^ce to the tenqierature 

25 under which the wash step is carried out Such wash tenq[)eratures axe typically selected to be about 
5^ to TXfC lower than the thermal melting point (T^ for the specific sequeiK:e at a defined ionic 
strength and pH. The T^, is the temperature (under defined iomc strength and pH) at wMch S0% of 
the target sequence hybridizes to a perfectly matched probe. An equation for calculating T„ and 
conditions for nucleic acid hybridization are well known and can be found in Sambrook, J. et al. 

30 (1989) Molecular gpnin^. A Laboratory Manual. 2"* ed., vol. 1-3, Cold Spring I^bcM: Press, 
Plainview NY; specifically see volume 2, chapter 9. 

ffigh stringency conditions for hybridization between polynucleotides of the present 
invention include wash conditions of 68*^0 in the presence of about 0.2 x SSC and about 0. 1 % SDS, 
for 1 hour. Alternatively, ten5)eratures of about 65**C eO'^C, SS^^Q or 42°C may be used. SSC 

3S concentration may be varied fix>m about 0.1 to 2 x SSC, with SDS being present at about 0.1%. 
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Typically, blocldiig reagents are used to block non-specific hybridization. Such blockmg reag^ts 
include, for iiistance, sheared and denatured sahmon sperm DNAa^ Organic 
solvent, such as formanude at a concentration of aboot 35-50% v/v, may also be used under particolar 
circumstances, such as for RNA:DNA hybridizations. Useful variations on these wash conditions 
5 wiUbereadily apparent to those of oidinaiy skin m the art. Hybridization, particularly under hig|h 
string^cy conditions, may be suggestive of evolutionary similarity between the nucleotides. Such 
similarity is strongly mdicalive of a similar role for the nucleotides and their encoded polypepitides. 

The term 'liybridizadon con^ilex'* refers to a complex formed between two nucleic acid 
sequencesby virtue of the formation of hydrogm bonds betwemcoiiqplementary bases. A 
10 l^bridizatim complex may be formed in solution (e.g., Qt or R^t analysis) or formed betwem one 
nucleic acid sequm:e present in solution and another nucleic add sequence immobilized on a solid 
support (e.g., papCT, membranes, filters, chips, pins or glass slides, or any other appropriate substrate 
to which cells or their nucleic acids have been fixed). 

The words 'ins^on" and '^addition'* refa: to chan^ in an amino add or nucleotide 
15 sequmce resulting in the addition of one or more amino acid residues or nucleotides, respectively. 

^Immune response*' can refer to conditions associated with inflammaticm, trauma, imnume 
' disorders, infectious or genetic disease, etc. These conditions can be characterized by egression 
of various factors, e.g., cytokines, chemoldnes, and oth^ signaling molecules, which may affect 
cellular and systemic defense systems. 
20 An "immunogenic fiagment** is a polypq>tide or o]igopq)tide fiagment of DMB which is 

capable of eliciting an imnnme response when introduced into a living organism, for exan^le, a 
mammal. The term "immunQg^c fiagmrat" also includes any polypq>tide or oligopeptide fiagment 
of DME which is usefid in any of the antibody production mettiods disdosed herein or known in tiie 
art 

25 The term ''microarray^ refers to an acrang^m»it of a plurality of polynucleotides, 

polypeptides, or other chemical cQnqx>unds on a substrate. 

The tenns **eleaQimt^ and "'atray elemenf ref to a polynucleotide, pcdyp^tide, car other 
chemical compoimd having a unique and defined position on a microarray . 

The tenn^^modulate'^refeis to a change in the activity of DN^ For exanyle, modulation 
30 may cause an increase or a decrease in protein activity, binding characteristics, or any other 
biological, functional, or immunological properties of DME. 

The phrases "nucldc add" and *'nucleic add sequmce" refer to a nucleotide, oligonucleotide, 
polynucleotide, or any fiagment thereof . These phrases also refer to DNA or RNA of genomic or 
synthetic origin which may be single-stranded or double-stranded and may represrat die sense or the 
35 antisraise strand, to peptide nucleic acid (PNA), or to any DNA-like or RNA-like material. 
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"Operably linked" lefi^ to tbe situation in which a fiist nucleic acid sequence is placed in a 
fimctionalielaticinship with a second nucldc ad For instance, a promoter is operably 

linked to a coding sequence if the promoter affects ttie transcription or e^^iession of the coding 
sequence. Opeiably linked DNA sequences nmy be in close proxinuty or contigu^ 

5 necessary to join two protein coding regions, in the same reading fiame. 

'Teptide nucleic acid" (PNA) refers to an antisense molecule or anti-gene agent which 
conqurises an oligonucleotide of at least about 5 nucleotides in length linked to a peptide backbone of 
amino acid residues ending hi lysine. The tenninal lysine confers solubility to the conqposition. 
FNAs preferentially bind coiiq)lementary single stranded DNA or RNA and stqp transcript 

1 0 elongation, and may be pegylated to extend their lifespan in the cell . 

'Tost-translatiodal modification" of an DME may involve lipidation, glycosylation, 
phosphorylation, acetylation, racemization, proteolytic cleavage, and otiier modificaticms known in 
the art Theseprocessesmay occur synthetically or biochemically. Bioch^cal modifications will 
vary by cell type deprading on the enzymatic milieu of DME. 

15 *T)robe" refers to nucleic acid sequences encoding DME, their conqplements, or fragments 

thereof, which are used to detect identical, allelic or related nucldc acid sequences. Probes are 
isolated oligonucleotides or polynucleotides attached to a detectable label or reporter molecule. 
Typical labels include radioactive isotopes, Ugands, chennluminescent agents, and raazymes. 
"Primers" are short nudeic adds, usually DNA oligonucleotides, which may be annealed to a target 

20 polynucleotide by compleminitarybaseipairing. The primer may thm be extended along the target 
DNA strand by a DNA polymerase enzyme. Primer pairs can be used for amplification (and 
identification) of a nucldc acid sequence, e.g., by the polymerase chain reaction (fCK). 

Ftobes and primers as used in tbe present invention typically con^^ 
nucleotides a known sequence, border to enhance spedfid^,loi:^ger probes and primers 

25 be miployed, such as probes and jmmers ttiat ccMUprise at least 20, 25, 30, 40, SO, 60, 70, 80, 90, 100, 
or at least ISO consecutive nucleotides of die disclosed nudeic add sequences. Ptobes and primers 
may be considerably longer tiian these exanq^les, and it is understood that any length supported by the 
specification, including tbe tables, figures, and Sequence listing, may be used. 

Methods for preparing and uang probes and primers are described in the refisrences, for 

30 exanq>le Sambrook, J. et al. (1989) Molecular q ntimf r A T ^boratorv Manual. 2°^ ed., vol. 1-3, Cold 
Spring Harbor Press, Plainview NY; Ausubel, FM. et aL (1987) Current Protocols m Molecular 
Biology. Greene Publ. Assoc. & Wley-lhtersdences, New York NY; hmis, M. et al. (1990) PGR 
Protocols. A Guide to Methods and Applications. Academic Press, San Diego CA PCRprimerpairs 
can be derived fiom a known sequence, for &Lanq>le, by using computer programs int^ded for that 

3S purpose such as Primer (Version 0.S, 1991, Whitehead Institute for Biomedical Research, Cambridge 
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MA). 

Oligoaucleotides for use as primers ate selected using software known in the art for such 
puipose. For exanq)le,OUGO 4.06 software is useful for die selection of PC^ 
100 nucleotides each, and for ttie analysis of oligonucleotides and larger polynucleotides of up to 

5 5,000 nucleotides ftom an input polynucleotide sequence of up to 32 kilobases. Similar primer 

selection ptpgcams have incorporated additional features for expanded capabilities. For exanqple, the 
FrimOU primer selection program (available to the public fiom the Genome Center at University of 
Texas South West Medical Center, Dallas TX) is capablo of choosing specific primers from 
megabase sequences and is thus useftd for designing primers on a genome-wide scope. The Primer3 

10 primer selection program (available to the public from the Whitehead Institute^MTT Craiter for 
Genome Research, Cambridge MA) allows the user to input a '^mispriming library," in which 
sequences to avoid as primer binding sites are user-specified. PrimerB is usefid, in particular, for the 
selection of oligonucleotides for microatrays. (The source code for the latter two primer selection 
programs may also be obtained fix>m their respective sources and modified to meet the user' s specific 

15 needs.) The PiimeGen program (available to the pubfic from the IJK Human 

Project Resource Cientre, C^ambridge UK) designs primers based on multiple sequence alignments, 
thereby allowipg selection of primers that hybridize to eitter the most conserved or least cons^ed 
regions of aligned nucleic acid sequences. Hence, this program is useful for identification of both 
unique and conserved oligonucleotides and polynucleotide fiagrnraits. The oligonucleotides and 

20 polynucleotide fragments identified by any of the above selection methods are useful in hybridization 
technologies, for exanople, as PCR or sequencing primers, ndcroairay el^m^ts, or specific probes to 
identify fully or partially conq;)lementary polynucleotides in a sample of nucleic acids. Methods of 
oligonucleotide selection are not limited to fliose described above. 

A '^recombinant nucldc acid'' is a sequence that is not natural^ occw^ 

25 that is made by an artificial combination of two or more otherwise separated segments of sequence. 
This artificial combination is often accQnq>lished by diemical synthesis or, more commonly, by the 
artificial manqmlatioii of isolated segments of nucleic acids, e.g., by genetic engineering techniques 
such as those described in SambiooL supra. The term recombiiiant includes nuckic adds that have 
beenalteisdsolelyby addition, substitution, or deletion of a portion of the nucleic a^ Frequmtiy,a 

30 recombinant nucleic acid may include a nucleic acid sequence operably linked to a promoter 
sequence. Such a recombinant nucleic acid iiiay be part of a vector that is used, for exam^ 
transform a celL 

Alternatively, such recombinant nucleic adds may be part of a viral vector, e.g., based on a 
vacdnia virus, that could be use to vaccinate a maTmml whesein the recambinant nucleic add is 
35 e^rpressed, inducing a protective inmnmological response in the manmiaL 
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A '^^idatOiry elemenf* lef eis to a nucleic acid sequence usually derived from untranslated 
regions of a and includes enhancers, piomoteis, introns, and 5' and 3' untranslated legions 
(UTRs). Regulatory elements interact with host or viral imsteu^ 
translatioii, or RNA stability. 

S 'Hepoarter molecules" are chemical or biochemical moieties used for labeling a nucleic acid, 

amino add, or antibody. Rq)orter molecules include radionuclides; enzymes; fluorescent, 
chemilummescent, or cfaromogenic agents; substrates; cofactors; inbilutors; magnetic particles; and 
other moieties known in tiie art , 

An '*RNA equivalent," in reference to a DNA sequence, is conqposed of tiie same linear 

1 0 sequence of nucleotides as the reference DNA sequence with the exception that all occurrences of tiie 
nitrpg^ous base thymine are replaced with uracil, and tiie sugar backbone is composed of ribose 
instead of deoxyribose. 

The tmn"sanq)le" is used in its broadest s«ise. A sanq>le suspected of containing DME, 
nucleic adds encoding DME, or fragments tiiereof may conqxrise a bodily fluid; an extract fixnn a 

15 cell, chromosome, organelle, or membrane isolated from a cell; a cell; genomic DNA, RNA, or 
cDNA, in solution or bound to a substrate; a tissue; a tissue print; etc. 

The terms '^specific binding" and "specifically Inndingf * refer to that interaction between a 
protem or pqytide and an agonist, an antibody, an antagonist, a small molecule, or any natural or 
synthetic binding conq[X)sition. The interaction is dependent upon tiie presence of a particular 

20 structure of the protdn, e.g., the antigenic determinant or q>itqpe, recognized by the Innding 
molecule. Forexanq)le,if an antibody is specific for q[>itope "A," the presence of a polypeptide 
conq>iising the epitcqpe A, or the presraoe of firee unlabeled A, in a reaction 
and the antibody wQl reduce the amount of labeled A that binds to the antibody. 

Hie teaxk "substantially purified" refers to nucldc add or amino add sequences that are 

25 removed from their natural environmi^t and are isolated or separated, and are at least 60% free, 

preferaUy at least 75% free, and most preferably at least 90% free from other ccH^pooents with which 
they are naturally associated. 

A "substitution" refers to the replacement of one or more amino add re^dues or nudeotides 
by diffenrat amino add residues nucleotides, respectively. 

30 "Substrate" refers to any suitable rigjid or semi-rig^d support including membranes, filters, 

chips, slides, wafers, fibers, magnetic or nonmagnetic beads, gels, tubing, plates, polymers, 
microparticles and c^pillanes. The substrate can have a vari^ of surface forms, such as wells, 
trenches, pins, channels and pores, to which polynucleotides or polypq>tides are bound. 

A "transcript image" or "expression profile" ref^s to the collective pattern of gene 

35 expression by a particular cell type or tissue under g^ven conditions at a given time. 
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'Tnmsfoixnation" describes a process by which exogenous DNA is introduced into a lec^ient 
cell. Transfomiation n^y occur under natural or artificial conditions according to various methods 
well known in the art, and may rely on any known method for the insertion of foreign nucleic acid 
sequences into a prokaryotic or eukaryotic host cell. Tlie method for transfomiation is selected based 
5 on the type of host cell being transformed and may include* but is not limited to, bacteriophage or 
viral infection, electroporation, heat shock, lipofection, and particle bombardment The term 
''transformed cells** includes stably transformed cells in which die inserted DNA is capable of 
replication either as an autonomously replicating plasmid or as part of the host chromosome, as well 
as transi^itly transformed ceUs which express the inserted DNA or RNA for limited periods of time. 

10 A "transg^c organism," as used herein, is any organism, including but not limited to 

animals and plants, in which one or more of the cells of the organism contains heterologous nucldc 
add introduced by way of human intervention, such as by transgenic techniques well known in the 
art The nucleic add is introduced into the cell, direcdy or indirectly by intn3duc^ 
of the cell, by way of deliberate graetic manipulation, such as by microinjection or by infection with 

15 a recombinant virus. The teem genetic manipulation does not include classical cross-breeding, or n| 
vitro fertilization, but ratha: is directed to the introduction of a recombinant DNA molecule. The 
transgenic organisms ccmtemplated in accosdance widi the present invention include bacteria, 
cyanobacteria, fungi, plants and animals. The isolated DNA of the present invention can be 
introduced into the host by mediods known in the art, for example infection, transfectira, 

20 transformaticm or transconjugation. Techniques for transferring the DNA of the present invootion 
into such organisms are widely known and provided in references such as Sambrook et aL (1989), 
supra. 

A **varianf of a particular nucleic add sequrace is defined as a nucleic add sequence having 
at least 40% sequence idraitity to the particular nucldc acid sequmce over a certain length of one of 

25 the nucleic add sequences uaboig blastn with the "BLAST 2 Sequences" tool Version 2.0.9 (May-O?*- 
1999) s^ at default parameters. Such a pair of nucleic adds may show, for exanqple, at least 50%, at 
least 60%, at least 70%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 
93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% or greater 
sequence id^itity ov^ a certain defined length. A variant may be described as, for exanq>le, an 

30 "allelic" (as defined above), "splice," "species," or "polymorphic" variant A splice variant may have 
significant identity to a reference molecule, but will generally have a greater or lesser number of 
polynucleotides due to alternate splicing of exons during mRNA processing. The corresponding 
polyp^tide may possess additional functional domains or lack domains that are pres^ in die 
reference molecule. Spedes variants are polynucleotide sequences that vary from one spedes to 

35 another. The resulting polypeptides wiU generaUy have significant amino acid identity relative to 
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each Other. ApolymoipMc variantisavanationmthe 

between mdividuals of a giveai species. Polymoiphic variants also may encompass "single nucleotide 
polymorphisms" (SNPs) in whidi the polynucleotide sequence varies by one nucleotide base. The 
presence of SNPs may be indicative of, for example, a certain population, a disease state, or a 

5 propensity for a disease state. 

A 'Variant" of a particular polypeptide sequence is defined as a polypeptide sequence having 
at least 40% sequence identity to the particular polypeptide sequence over a certam length of one of 
the polypeptide sequences using blas^ with the ''BLAST 2 Sequences" tool Version 2.0.9 (May-07- 
1999) set at de&ult parameters. Such a pak of polypeptides may show, for example, at least 50%, at 

10 least 60%, at least 70%, at least 80%, at least 90%, at least 91%, at least 92%, at least 93%, at least 
94%, at least 9S%, at least 96%, at least 97%, at least 98%, or at least 99% or greater sequence 
identity over a certain defined length of one of the polypeptides. 

THEINVENTION 

15 The invention is based cm the discov^ of new human drug metaboliang enzymes (DME), 

die polynucleotides encoding DME, and the use of these compositions for the diagnosis, treatment, ex 
inevention of autoimmune^nflammatCMy, cell proli£ra:ative, neurological, developmental, endocrine, 
eye, metabolic, and gastnnntesthial disorders, including liver disoacd^. 

Table 1 summarizes ttienoinenclature for the fuUleiigthpolynuclro 

20 sequences of the invration. EaiA polynucleotide and its cocrespoiiding polypeptide are correlated to 
^gle hicyte project identification numiber (Incyte Project ID). Each polypeptide sequence is denoted 
by both a polypq[>tide sequence idmtificaticm number (Polypeptide SEQ ID NO:) and an Incyte 
polypq^de sequence nuniberCEiicytePcdypept^ Eadi polynucleotide sequence is 

doioted by bodi a polynucleotide sequence identification numbor (Polynucleotide SEQ ID NO:) and 

25 an hicyte polynucleotide consensus sequence number (hicyte Polynucleotide ID) as shown. 

Table 2 shows sequences with homology to the polypeptides of the inveaticn as identified by 
BLAST analysis against the GmBank protrin (g/sapepQ database. Columns 1 and 2 show the 
polypeptide sequence identification number (Polypeptide SEQ ID NO:) and the corresponding hicyte 
polyp^tidesequrace number (hicyte Polypqjtide ID) for polypq^ ColumnS 

30 shows the (j^ank identification number (GenBank ID NO:) of the nearest GenBank homolog. 

(Column 4 shows the probability scores for the matches between each polypqitide and its homolog(s). 
Column S shows the annotation of the (3ctB ank homologs along witti relevant citations wherei 
^licable, all of which are expressly incorporated by reference herem. 

Table 3 shows various stnictural features of the polypeptides of the invmtion. Columns 1 

35 and 2 show tiie polypeptide sequence idoitification number (SEQ ID NO:) and the cocresponding 
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iQcyte polypeptide sequence number (Encyte Polypeptide ID) for each polypeptide of tiie invoition. 
Cohmm 3 shows the number of amino acid residues in each polypeptide. Column 4 shows potential 
phosphorylation sites, and column 5 shows potential glycosylation sites, as detamined by the 
MOTIFS program of the QCO sequence analysis software package (Genetics Conq>uter Group, 

5 Madison WI). Q>lunm 6 shows annno add residues comprising signature sequences, domaii^ 
motifs. Cblumn 7 shows analytical methods for pn)teinstnictureAfi^ 
searchable databases to which the analytical methods were ^plied. 

Togetiier, Tables 2 and 3 smnmarize the properties of polypq>tides of the invention, and these 
properties establish that the claimed polypq>tides are drug nietabolizm SEQIDNOtlis 

10 81% identical to rat cytochrome P4S0 (GenBank ID g203890) as determined by the Basic Local 
Alignment Search Tod (BLAST). (See Tsible 2.) The BLAST probability score is 6.9e-22S, which 
indicates the probability of obtaming the observed polypeptide sequence aligmnent by chance. SEQ 
ID N0:1 is also 59% identical to rabbit cytochrome P450 2A10 (GenBank ID gI65433) and 60% 
identical to rabbit cytochrome P4S0 2A1 1 (GenBank ID gl6S43S), as detmnined by BLAST 

15 analysis, witii probability scores of 1.6e-163 and 2.0e-163, respectively. SEQ ID NO. l also contains 
a cytochrome P4S0 domain as determined by searching for statistically significant matches in the 
Udden Markov modd(HMM)-basedFFAM database of coiiserved protein fami^ (See 
Table 3.) Data fixmi BLIMPS, MOTIFS, and PROHLESCAN analyses provide further corroborative 
evidence tiiat SEQ ID NO: 1 is a cytochrome P450. 

20 SEQ ID NO:2 is 64% identical to human glutathione S-transferase omega (G^ank ID 

ggl32762), as detenmned by BLAST analysis, with a probability score of 1.6e-83. SEQ ID NO:2 
also cont^ms a glutaduone S-transfesrase domain as determined by searching for statistically 
significant matches in the hidden Markov model (HMM>-based FFAM database of conserved protein 
fianQy domains. Data frcxn BLIMPS analysis provides furthra- c(Hrob(B:ative evidence that SEQ ID 

25 NO:2 is a glutathic»ie S-transfeiase. 

SEQ ID NO:3 is 76% idmtical to human heparan sul&te N-deacetylaseAN-sulfotransferase 3 
(GenBank ID g4322247), as determined by BLAST analysis, wiOi a negligible piobalrility score. 
SEQ m NO:3 is also 78% identical to two other famnan hqiaran sul&te N-4effl 
sulfotransferases (GoiBanklDs glQ36797 and g976372), based on BLAST analysis, also witii 

30 negligible probability scores. 

SEQ ID NO:4 is 58% id^tical to human N-acetylgalactosaminyItransf(n:ase (GenBank ID 
g2121220) as detennmed by the Basic Local Alignment Se^ (See Table 2.) Hie 

BLAST probability score is 8.7e-172, which indicates the probability of obtaining tiie observed 
polypeptide sequence aligmnent by chance. SEQIDNO:4isalsoS7% and46% id^tical totwo 

35 other hmnan N-acetylgalactosammyltransferases (GenBank IDs gl934912 and g5834600, 
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respectively), based on BLAST analysis, with BLAST probability scores of 7.2e-168 and L4e-118, 
respectively. SEQ n> NO:4 ako contahis a gjycosyltransfoase active site domam 
searching for statistically significant matches m the hidden Maikov model (HMM)-based FPAM 
database of conserved piotein family domams. (See Table 3.) Data fiom MOTIFS analysis provides 

S farther corroborative evidmce that SEQ ID N0:4 is a N-acetylgalactosaminyltransfanase. 

SEQ ID NO:5 shares 98% overall identity to rat nemoligin 2 (GenBank ID gll4S789) and 
100% idmtity, over 550 amino acid residues, to a human neuroligm 3 isofonn (GenBank ID 
g7960135), based on BLAST analysis. In both cases, tiie BLAST probability scores are negligible. 
The polynucleotide eaicoding tiie polypq>tide of SEQ ID N0:5 G.e., SEQ ID NO: 18) and tiie 

10 polynucleotide encoding gll45789 (i.e., gll45788) share 78% overall identity and 92% identity 
withm the respective coding regicms, based on BLAST analysis. Neuroligins contam 
carboxylesteras&-like domains although they are catalytically inactive. SEQ ID NO:S contains a 
caiboxylestranase-like domain as determined by searching for statistically significant matches in the 
hidden Markov model (HMM)-based FPAM database of conserved pn^in family domains. (See 

15 Tables.) Data from MC)TIFS,FROF0^CAN, and BLIMPS analysis pn>vid^ 
evidence that SEQ ID NO:2 is a neuroligin. 

SEQ ID NO:6 is 51% identical to human UDP-GalNAcrpolypeptide N-acetylgalactosaminyl 
transferase (GenB ank ID g97 146 1 ) as determined by the Basic Local Alignmmt Search Tool 
(BLAST). (See Table 2.) The BLAST probalnUty score is 3.0e-146, which indicates the probability 

20 of obtaining the obsegrved polypeptide sequence aligning by chance. SEQ ID NO:6 also contains a 
gly cosyl transferase domain as determined by searching for statistically significant matches in the 
Mdden Maitov model (HMM)-based FPAM database of conserved protein family domains. (See 
Tables.) 

SEQ ID NO:7 is 87% identical to murine cartxmyl reductase (GenBank ID gl049108), a 
25 short-<jiain alcohol dehydrogenase, as d^enmned by BLAST a^ The BLAST probability score 
isl3e-128. SEQ ID NO:7 is also 85% identical to five human carbonyl reductases ((BenBank IDs 
gl79978, gl81037, £3702679, g0693616, and |f7768722), as dfttmnmed by BLAST analysis, aU witii 
probability scores of 5.2e-126. SEQ ID NO:7 also coiitains a shoit-dbain alcohol ddiyd^^ 
domain as determined by searching for statistically significant matches in tfie hidden Markov model 
30 (HNIM)-based FPAM database of conserved protein fiunilydor^^ Data ficom BLIMPS, MOTIFS, 
and PROFDLESCAN analyses provide further cotroboradve evidence that SEQ ID NO :7 is a short- 
chain alcohol dehydrogenase. 

SEQ ID NO:8 is 49% identical to Oiinese hamster b^-l.S-N-acetylghicosanunyl transferase 
(GenBank ID gl698601), as det^rmmed by BLAST analysis, witti a probatulity score of l.le-169. 
35 SEQ ID NO:8 is also 51% identical to a human beta-l,6-N-acetylgluco5aminyl transferase (GraiBank 
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ID g4S45222), as detennined by BLAST analysis, with a probability score of 2.0e-168. 

SEQ ID NO:9 is 66% identical to an N-acetyltransfearase from Pseudomonas aemp iTinsa 
(GenBank ID g9946340), as detennined by BLAST analysis, with a probability score of S.8&-S6. 
SEQ ID NO:9 is also 35% identical to anunine (GenBank ID g3095O4) and a human (GenBank ID 

5 gl 103904) N-acetyltransferase, as detemodned by BLAST analysis, with BLAST probability scores of 
2.0e-21 and 6.9e-21, respectively. SEQ ID NO:9 also contains an acetyltransferase domain as 
detecmmed by searching for statistically significant matches in the hidden Markov model (HMM)- 
based PFAM database of conserved protein family domaios. Data from BLIMPS analysis provide 
finrtiier cortoborative evidence that SEQ ID NO:9 is an acetyltransferase. 

10 SEQ ID NO: 10 is 49% identical (over S2S anuno add residues) to a hamster carboxylesterase 

precursor (GenBank ID g2641986) as det^mined by the Basic Local Alignment Search Tool 
(BLAST). (See Table 2.) The BLAST probability score is L9e-130, which mdicates the probability 
of obtaining the obs^ed polypeptide sequence dignment by chance. SEQ ID NO:10 is also 48% 
identical to a rat carboxylesterase precursor (over 524 amino acid residues; GenBank ID g2641992) 

15 and 47% identical to human carboxylesterase hCE'2 (over 525 amino add residues; GenBank ID 
gl407780), based on BLAST analysis, with BLAST probability scores of 3.1e-128 and 2.6e-124, 
respectively. SEQ ID NO:10 also contains a carboxylesterase domain as determined by searching for 
statistically significant matches in the hidden Markov model (HMM)-based PFAM database of 
conserved protdn family domains. (See Table 3.) Data from BLIMPS, MOTIFS, and 

20 PROFILESCAN analyses provide fiirtiier corroborative evidence that SEQ ID NO:10 is a 
carboxylesterase. 

SEQ ID NO: 1 1 is 62% identical (over 108 amino add residues) to a CuAi-superoxide 
dismutase from Caulobacter crescemtns (GenBank ID pl44283V as determined by BLAST amdysis, 
with a probability score of 4. 1&-32. SEQ ID NO: 11 also contains a Cu/Zn-superoxidedisnnitase 

25 domain as determined by searching f<x statistically significant matches in the hiddCT Markov model 
(EIMM)4>ased PFAM database of conserved protein fairnly domains. (See Table 3.) Datafrom 
BLIMPS, MOTIFS, and PROFILESCAN analyses provide furtiiBr corroborative evidence tiiat SEQ 
ID NOrll is a Ca/2^Q-siq)eroxide disnmtase. 

SEQ ID NO: 12 is 86% id^itical, from residue Ml to residue F222, to four human glutathione 

30 S-transferases (GenBank IDs g306809, g306810, g259141, and g242749) as determined by the Basic 
Local Alignment Search Tool (BLAST). (See Table 2.) The BLAST probability scores are 6.3e-98, 
which indicate the probability of obtaining the observed polypeptide sequence alignment by chance. 
SEQ ID NO: 12 also contains a glutathione S-transferases domain as determined by searching for 
statistically significant matches in tiie hidden Markov model (HMM>based PFAM database of 

35 conserved protein family domains. (See Table 3.) Data from BLIMPS analysis provides fiirtfaer 
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carroboiative evidence that SEQ ID N0:12 is a glutathione S-transferase. 

SEQ ID N0:13 is 55% identical, from residue R2 to residue V273, to Escherichia coli 
iiiaiphme-6-dehydrogpnase (GenBank ID gl736409) as det^mmed by BLAST analysis. The BLAST 
probability score is 8.0&-7S. SEQ ID NO:2 also contains an aldo/keto reductase domain as 
5 determined by searching for statistically significant matches in the hidden Markov model (HMM)- 
based PFAM database of conserved pnitein fenuly domains. Data from BLIMPS and 
FROFBLESCAN analyses provide further corroborative evidence tiiat SEQ ID NO: 13 is an aldo/keto 
reductase. The algorithms and parameters for the analysis of SEQ ID NO:l-13 are descri 
7. 

10 As shown in Table 4, the full length polynucleotide sequraices of the preset mvention were 

assraibied using cDNA sequmces or coding (exon) sequences derived fit>m genomic DNA, or any 
comtunation of these two types of sequences. Column 1 lists the polynucleotide seqaeaco 
identification number (Polynucleotide SEQ ID NO:), the corresponding Bicyte polynucleotide 
consensus sequence number (bicyte ID) for each polynucleotide of the invention, and the length of 

15 each polynucleotide sequence in basepairs. Column 2 shows the nucleotide start (S^) and stop (3^) 
positions of the cDNA and/or genomic sequences used to assemble the full lengdi polynucleotide 
sequences of tl» invention, and of fragments of the polynucleotide sequences which are useful, for 
■ * exaniple^ in hybridization or ampMcation technologies diat identify SEQ ID NO: 14-26 or that 
distmguish betwem SEQ ID NO: 14-26 and related polynucleotide sequraces. 

20 The polynucleotide fragmmts described in Column 2 ofTable 4 may refer specificaU 

exanq>le, to Incyte cDNAs (ferived from tissue-specific cDNA libraries or fitxm pooled cDNA 
libraries. Alternatively, the polynucleotide fragments desciil)ed in colunm 2 may refer to GenBank 
cDNAs or ESTs wtncb attributed to the assembly of the fiill length polynuclec^de sequences. Jn 
addition, the polynucleotide fragments described in column 2 may identify sequences derived from 

25 the ENSEMBL (The Sang^ Centre, Cambridge, UK) database (L e , tfiose sequences including the 
designaticm'IBNSrO. Altmiatively, the polynudeotidefragn«its described in colunm 
derived firon Ae NCBI ReCSeq Nucleotide SequoQce Rbco^ 

includmg the designatiOT "NM" or '*NTO or the NCBI RefSeq Protein Sequence Records (i e. , those 
sequences includiiig the designation 'TIF'). Alternatively, the polynucleotide fragments described in 

30 column 2 may refer to assemblages of both cDNA and Genscan-piedicted exons brought together by 
an "exon stitching*' algorithm. For exan]ple, a polynucleotide sequence identified as 
FLJKXXXXXJfjJfsjnnnrrjfU^^ represmts a "stitched" sequence m which XXXXXX is tite 
identification number of the cluster of sequences to which the algorithm was applied, and YYYYY is 
the number of the prediction generated by the algorithm, and Njj^^ if present, represent specific 

35 exons that may have been manually edited during analysis (See Exanyle V). Alternatively, tiie 
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polynucleotide fragooents in column 2 may lefer to assemblages of exons brought tog^er by an 
**exon-stietching^ algoriflmL For exan^le, a polynucleotide sequence identified as 
FlJOaOOOLgAAAAA^BBBB^lJf is a "stretched" sequence, with XXXXXX being the Incyte 
project identification numb», gMAAA being the GenBank identification number of the human 
gnomic sequence to which the "exon-stretching" algorithm was applied, gfiBBBB being the 
G^iBank identification number or NCBI RefSeq identification number of the nearest GenBank 
prc^in homolog, and N leferring to specific exons (See Example V). In instances where a RefSeq 
sequence was used as a protein homolog for the "exon-stretching** algorithm, a RefSeq ideotifi^ 
(denoted by ''NM," "NP," cnr "NT") may be used m place of tiie GenBank identifier (ie., gBBBBS), 

Alt^natively, a prefix identifies conq>onent sequences that were hand-edited, predicted ficom 
genomic DNA sequences, or derived firom a combination of sequrace analysis methods. The 
following Table lists exanq>les of conq[)Qnent sequence prefixes and conesponding sequence analysis 
mediods associated with the prefixes (see Example IV and Example V). 



Prefix 


Type of analysis and/or examples of programs 


GNN, GFG. 
ENST 


Exon prediction from gnomic sequences using, for exanq>le, 
GENSCAN (Stanford Univ^ity, CA, USA) or FGENES 
(Computer Genomics Group, The Sanger Centre, Cambridge, UK). 


GBI 


Hand-edited analysis of genomic sequences. 


EL 


Stitched €r stretched genomic sequences (see BKample V). 


INCY 


Full length transcript and exon predicticm fix>m mapping of EST 
sequences to the genome. Genomic location and EST composition 
data are combined to predict the excns and lesoItiDg tcanscnqpt 



In some cases, Incyte cDNA coverage redundant with the sequence covera^ shown in Table 
4 was obtained to confirm the final consensus polynucleotide sequence, but the relevant hicyte cDNA 
identification numbers are not shown. 

Table S shows the tqpres^itative cJ)NA Ubraries for those fiiU lengdi poly^ 
sequences which were assraibled using hicyte cDNA sequences. The representative cDNA library is 
the hicyte cDNA library which is muost frequoitly represented by the Incyte cDNA sequences which 
were used to assemble and confirm the above polynucleotide sequences. The tissues and vectors 
which were used to construct the cDNA libraries shown in Table S are described in Table 6. 

The inv^tion also enconQ>asses DMB variants. A preferred DME variant is one which has at 
least about 80%, or alternatively at least about 90%, or even at least about 95% amino add sequence 
identity to the DME amino acid sequence, and which contains at least one functional or structural 
characteristic of DME. 
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The inventioii also encompasses polynuclec^des wMch encode DME. In a particular 
embodiment, tbe invention encompasses a polynucleotide sequence conpising a sequence selected 
fix>m the gtoiq> consisting of SEQ ID NO: 14-26, wMch encodes DME The polynucleotide seqii^o^ 
of SEQ ID NO: 14-26, as presented in tbe Sequence listing, embrace the equivalrat RNA sequraces, 

5 wherein occunences of the nitipgraous base tlqinine are replaced with uracil, and flie sugar backbone 
is conoposed of ribose instead of deo?^bose. 

The inventira also encoiqasses a variant of a polynucleotide sequence encoding DME. la 
particular, such a variant polynucleotide sequence will have at least about 70%, or alternatively at 
least about 85%, or even at least about 9S% polynucleotide sequence idraitity to the polynucleotide 

10 sequence encoding DMR Aparticularaspectof the invention encompasses a variant of a 
polynucleotide sequence conqnising a sequence selected from the group consisting of SEQ ID 
NO: 14-26 which has at least about 70%, or alternatively at least about 85%, or even at least about 
95% polynucleotide sequence identity to a nucldc acid sequence selected from the group consisting 
of SEQ ID NO: 14-26. Any one of the polynucleotide variants described above can encode an anuno 

15 acid sequence which contains at least one functional or structural characteristic of DME. 

In addition, or in the altemative, a polynucleotide variant of the invention is a splice variant 
of a polynucleotide sequence encoding DME. A splice variant may have portions v^ch have 
significant sequoice identity to tiie polynucleotide sequence encoding DME, but win g^erally have a 
greater or lesser number of polynucleotides due to additicsos or deletions of blocks of sequence arising 

20 frcmdteniate splicing of exons during niRNAprocessin^^ A splice variant may have less than about 
70%, or alternatively less than about 60%, or alternatively less than about 50% polynucleotide 
sequence id^itity to the polynudeotide sequence encoding DME over its entire l^ogdi; howev^, 
porticMis of the splice variant will have at least about 70%, or alternatively at least about 85%, or 
alternatively at least about 95%, or alternatively 100% polynucleotide sequence idraitity to porticms of 

25 the polynucleotide sequsQce encoding DM& Any one of the splice variants described above can 
encode an ainino add sequence which contains at least one fimctional or str^ 
DMR 

It will be q)precaated by those skilled in tiie art that as a resnU 
genetic code, a multitude of polynucleotide sequences encoding DME, some bearing minimal 

30 similarity to the polynucleotide sequ^ces of any known and naturally occurring gene, may be 
produced Thus, the invention contemplates each and every possible variation of polynucleotide 
sequence that could be made by selecting combinations based on possible codon choices. Iliese 
combinations are made in accordance with the standard triplet genetic code as applied to the 
polynucleotide sequence of naturally occurring DME, and all such variations are to be considered as 

35 being specifically disdosed. 
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Although nucleotide sequences which encode DME and its variants are generally capable of 
l^bridizing to the nucleotide sequence of the naturally occurring DME under appropriately selected 
conditions of string^ocy , it may be advantageous to produce nucleoti^ sequences encoding DME or 
its derivatives possessing a substantially different codon usagp, e.g., inclusion of non-naturally 
5 occurring codoDS. Codonsnoaybeselectedtoinczeasedierateat which expression of the peptide 
occurs in a particular prolkaryotic or eukaiyotic host in accordance with the £nequency with which 
particular codons are udUzed by the host Other reasons for substantially altering the nucleotide 
sequence encoding DME and its derivatives without altering the eiK^oded amino add sequmces 
include die production of RNA transcripts having more desirable properties , such as a greater 

10 half-life, than transcripts produced fix»n die naturally occurring sequence. 

The invention also mcoIK^>asses production of DNA sequences which encode DME and 
DMB derivatives, or fiagments thereof, eatixely by synthetic chemistry. After production, the 
synthetic sequence may be inserted into aiiy of the many available e:qnressiQn vectors and cell 
systems using reag^ts well known in the art Moreover.syntheticchemistry may beusedto 

15 introduce mutations into a sequoice encoding DME or any fragment thmof . 

Also enconqpassed by the invention are polynucleotide sequences that are capable of 
hybridizing to the claimed polynucleotide sequ^ces, and, in particular, to those shown in SEQ ID 
NO: 14-26 and fiagments thereof undem various conditions of stringency. (See, e.g., Wahl, GM. and 
SX. Berger (1987) Methods EnzymoL 152:399-407; Kimmel, AJL (1987) Methods EnrymoL 

20 152:507-51 1.) Hybridization ccmditions, including annealing and wash conditions, are described in 
"Definitions." 

Methods for DNA sequencing are well known in the art and may be used to practice any of 
the embodiments of the invention. The mediods may eiiq>]oy such enzymes as the Klenow firagmrat 
of DNA polymerase I, SEQUENASE (US Biochemical, Cleveland OH), Taq polymmse (Applied 
25 Biosystems), tfiermostable T7 polymerase (Amersham Hiarmacia Biotech, Piscataway NJ), or 

c<HnhinaticHis of polymerases and proofreading excmucleases such as tiiose f ouikI in die ELONGASE 
an^Mcationsfystem (Life Technologies, (Saitfieirsbur^ PkefearaUy, sequence preparation is 
automated with machines sudi as tibe MICROI^ 2200 liquU 

FTC200 ttiermal cycler (MT Research, Watetown MA) and ABI CATALYST 800 tiiermal cycler 
30 (Applied Biosystems). Sequencing is then carried out usmg either the ABI 373 or 377 DNA 
sequencing system (Applied Biosystems), die MEGAB ACE 1000 DNA sequencing system 
(Molecular Dynamics, Smmyvale CA), or other systems known in the art The resulting sequences 
are analyzed using a variety of algorithms which are weU known in the art (See, e.g., Ausubel, F.M. 
(1997) ShOTt Protocols in Molecular Biology. John Wiley & Sons, New York NY, unit 7.7; Meyers, 
35 ILA. (1995) VToiflrnlar Biolo^ and Rint echnologv^ Wiley VCH, New Yoik NY, pp. 856-853.) 
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The nucleic acid sequences encoding DMB may be ext^ided utilizing a partial nucleotide 
sequence and enq>loying various PCR-based methods known in the art to detect upstream sequences, 
such as promote and regulatoiyelem^ts. Forexa]xq>le,onemethodwhidi may be employed, 
restriction-site PGR, uses universal and nested primers to anq>Iify unknown sequence from genomic 

5 DNA within a cloning vector. (See, e.g., Sarkar, G. (1993) PGR Methods Applic. 2:318-322.) 
Another method, inverse PGR, uses primers that extrad in divergent directions to anq>lify unknown 
sequence from a circularized tenq)late. The template is d^ved from restriction fragments conqprising 
a known genomic locus and surrounding sequraces. (See, e.g., Triglia, T. et al. (1988) Nucleic Adds 
Res. 16:8186.) A third method, c^ture PCR, involves PGR amplification of DNA fragments 

10 adjacmt to known sequences in human and yeast artificial chromosome DNA. (See, e.g., Lag^trom, 
M.etal. (1991) PGR Methods .^)plic. 1:111-119.) In this method, multiple restriction enzyme 
dig^tions and ligations may be used to insert an en^eered double-stranded sequence into a region 
of unknown sequence befcMB performing PGR. Other methods which may be used to retrieve 
unknown sequences are known in the art (See, e.g., Parker, J J>. et al. (1991) Nucleic Adds Res. 

15 19:3055-3060). Additionally, one may use PGR, nested primiers, and PROMOTEEUPINDER libraries 
(Qontedi, Palo Alto CA) to walk genomic DNA. This procedure avoids the need to screen libraries 
and is useful in finding mtron/exon junctions. For all PCR-based methods, primers may be designed 
using conmnercially available software, such as OLIGO 4.06 primer analysis software (National 
Biosciences, Plymoutii MN) or another appropriate program, to be about 22 to 30 nucleotides in 

20 length, to have a GC content of about 50% or more, and to anneal to the template at temperatures of 
about 68^G to 72°C 

When screening for full length cDNAs, it is preferable to use libraries that have been 
size-selected to include larger cDNAs. In addition, random-primed libraries, which often include 
sequences containing the 5* regions of genes, are preferable for situations in which an oligo d(T) 

25 library does not yield a full-length cDNA. Genomic libraries may be useful for extension of sequence 
into 5' non-4ranscribed regulatory regions. 

GapiUary electrophoresis systems which are comm^rdaUy available may be used to analyze 
the size or confirm tb& nucleotide sequence of sequencing or PGR products, la particular, c^illaiy 
sequencing may enq>loy flowable polymers for eiectrophoretic separation, four different nucleotide- 

30 specific, laser-stimulated fluorescent dyes, and a charge coupled device camera for detection of the 
emitted wavelengths. Ou^ut/light intensity may be converted to electrical signal using appropriate 
software (e.g., GENOTYPER and SEQUENCE NAVIGATOR, Applied Biosystems), and the entire 
process from loading of satiq)les to coiiq)ut^ analysis and electronic data display roay be conq>uter 
controlled GapiUary electrophoresis is especially preferable for sequencing small DNA fragments 

35 which may be present in limited amounts in a particular sample. 
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Iq anotber easbodiment of the inventicm, polynucleotide sequences or firagments thereof 
which encode DME may be cloned in lecombinant DNA nx>lecules fliat diiect expiesdan of DME, or 
fragments or functional eqmvd«(itstbei^ Duetothemherentdegraieracy 
of the genetic code, oth^ DNA sequraces which encode substantially the same or a functionally 

5 equivalent amino acid sequence may be produced and used to express DME. 

The nucleotide sequences of die present invention can be engineered using methods generally 
known in the art in radar to alter DME^coding sequraces for a variety of purposes including, but 
not limited to, modification of the cloning, processing, and/or expression of the gene product. DNA 
shufQiog by random fragmentation and PCR reassembly of gene fragnmts and synthetic 

10 oligonucleotides may be used to engineer the nucleotide sequences. For example, oligonucleotide- 
mediated site-directed mutagpiesis may be used to introduce mutations that create new restriction 
sites, alter glycosylation patterns, change codon preference, produce splice variants, and so fortihu 

The nucleotides of the j^cesent invention may be subjected to DNA shufOing techniques such 
as MOIJBCULARBREEDING Q^faxygen Inc., Santa C3am CA; described in U.S. Patent Na 

15 5,837,458; Chang, G-C et aL (1999) Nat BiotecfanoL 17:793-797; Christians, RC et al. (1999) Nat 
Biotechnol. 17:259-264; and Crameri A. et al. (1996) Nat Biotechnol. 14:315-319) to alter or 
improve tiie biological properties of DME, such as its biological or enzymatic activity or its ability to 
bind to odier molecules or compounds. DNAshufQingisaprocessby which a library of gene 
variants is produced ugng PCR-mediated recQmiMnation of gpne ftagments- The library is dien 

20 subjected to selection or screening procedures that identify those gene variants with the desired 
properties. Thesepreferred variants rnay then be pooled and further subjected to rec^ 
DNA shuffling and selection/screening. Thus, genetic diversity is created tiaoug^ "artificial" 
breeding and nq[ndmd[ecular evolution. Fcsr example, fragments of a ang^ gene containing random 
point mutaticHis may be recombined, scre^ied, and then resfaufOed until the desired properties are * 

25 optimized. AIteniativdy,fiagniBntsof a given gene may be recombioed with fr^^ 

homologous genes in the same gene family, either from tiie same or different species, thereby 
maximizing the genetic diversity of nudtqde naturally oocurriiig ^nes in a directed and controllable 
manner. 

In another embodiment, sequraices encoding DME may be synthesized, in whole or in part, 
30 using chemical methods well known in the art (See, e.g., Caruthers, M JEL et al. (1980) Nucleic Acids 

Symp. Ser. 7:215-223; and Horn, T. et at (1980) Nucleic Acids Symp, Ser. 7:225-232.) 

Alteriiatively,DMEitself or a fragmraittiiereofinay be synthesized using chemicd For 

example, p^tide synthesis can be performed using various sohition-f^hase or solid-phase techniques. 

(See, e.g., Crei^bton, T. (1984) Protdns> Structures and Molecular Piropaties. WH Freeman, New 
35 York NY, pp. 55-60; and Roberge, J.Y. et al. (1995) Science 269:202-204.) Automated syntiiesis 
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may be achieved using the ABI 431A peptide synthesizer (Applied Biosystems). Additionally, the 
amino add sequrace of DME, or any part dieieof, may be altered during direct synthesis and/or 
combined with sequences finom odier proteins, or any part thereof, to produce a variant polypeptide or 
a polypeptide having a seqnrace of a naturally occurring polypeptide. 

5 The peptide may be substantially purified by preparative high p«f ormance liquid 

chromatography, (See, e.g.. Chira; R.M. and RZ. Regnier (1990) Methods Bazymol. 182:392-421.) 
The composition of the synthetic peptides may be confirmed by amino acid analysis or by 
sequencmg. (See, e.g.,Crdght(ni,suBia, pp. 28-530 

In order to ex|xress a biologically active DMB, the nucleotide sequences encoding DME or 

10 dmvatives thereof may be inserted into an appropriate e^qxression vector, Le., a vector which contains 
the necessary elements for transcrq>tional and translational control of the inserted coding sequence in 
a suitable host These elemraits include regulatory sequences, such as enhances, constitutive and 
inducible promoters, and S' and 3' untranslated regions in the vector and in polynucleotide sequraces 
encodingDME. Such elemrats may vary in their strengdi and specificity. Specific initiation signals 

15 may also be used to achieve mozeefBdent translation of sequences encoding DME Such signals 
include the ATG initiation codon and adjacent sequences, e.g. tiie Kozak sequence. In cases where 
sequences encoding DMB and its initiation codon and upstream regulatocy sequences are mserted into 
the appropriate e^qniession vector, no additional transcripdmal or translaticaial control signals may be 
needed. However, in cases where only coding sequence, or a fragroent thereof, is mser^ 

20 translational control signals including an in-frame ATG initiation codon should be provided by tiie 
vector. Bxogsncm translational elements and initiatian codons may be of various origins, both 
natural and synthetic. Theeffidency of egqxressicm may be enhance^ by the indnsion of enhancers 
sqyprppriate far the particular host ceU system u^ (See, e.g.,Scharf,D.etaL (1994) Results ProbL 
Ctell Dififa. 20:125-162.) 

25 Methods which are weU known to those sidlled in the art inay be used to con^ 

vectors containing sequences ftnmHmg DME and appropriate transcrq^onal and translational control 
.deuKnts. These methods include in vitro recombinant DNA technifines, syntfa^ic techniques, and m 
vivogep^ieccHnbinatim. (See, ^g., Sambmoii J. ^ al. f 1989> Molecular Qqim g. A Lahcaatnrv 
Manual Cold Spring Harbor Plress, Plainview NY, ch. 4, 8, and 16-17; Ausubd, FM. et aL (1995) 

30 Current Protocols in Molecular Biolo^. John Wiley & Sons, New YoricNY, ch. 9, 13, and 16.) 

A variety of expression vectoi/host systrais may be utilized to contain and express sequences 
CTCodingDME. These include, but are not lirmted to, nncro(»:ganisms such as bacte^ 
with recombinant bacteriophage, plasmid, or cosmidDNA expression vectors; yeast transformed with 
yeast expression vectors; insect cell systems infected with viral expression vectoxs (e.g., baculoviius); 

35 plant cell systems transformed witii vkal expression vectors (e.g., cauliflower mosaic virus, CaMV, 
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or tobacco mosaic virus, TMV) or witfi bacterial expression vectors (e-g.. Ti or pBR322 plasmids); or 
animal cell syst^ns. (See, &g., Samfarook, siipra: Ausubel, supra: Van Heeke, G. and SM. Schuster 
(1989) I Biol. Chem. 264:5503-5509; Engelhard, RK. d aL (1994) Proc. Natl. Acad. ScL USA 
91:3224-3227; Sandig, V. et aL (1996) Hum. Gene Ther, 7:1937-1945; Takamatsu, N. (1987) BMBO 

5 J. 6:307-311; The McGraw IBll Yearbook of Science and Technology (1992) McGraw Hill, New 
York NY, pp. 191-196; Logan, J. and T. Shenk (1984) Proc. Natl. Acad. Sci. USA 81:3655-3659; and 
Harrington, J J. et al. (1997) Nat GcwL 15:345-355.) Expression vectors derived from retroviruses, 
adoDioviruses, or herpes or vaccinia viruses, or from various bacterial plasmids, may be used for 
deUveryofnucleotidesequeaces to the targeted organ, tissue, or ceUpc^ (See,e.g.,I>i 

10 Nicola, M. et al. (1998) Cancer Gen. Ther. 5(6):350-356; Yu, M. et aL (1993) Proc. Nad. Acad. ScL 
USA 90(13):6340-6344; BuUer, ILM. et al. (1985) Nature 317(6040):813-815; McGregor, DP. et al. 
(1994) MoL Immunol. 31(3):219-226; and Vcama, LM. and N. Somia (1997) Nature 389:239-242.) 
The mvention is not limited by the host cell enq>loy ed. 

hi bacterial systems, a nunab^ of cloning and e^qpression vect(»rs may be sdected depending 

15 upon the use intended for polynucleotide sequences encoding DME. For exanqile, routine cloning, 
subcloning, and propagation of polynucleotide sequences encoding DME can be achieved using a 
multifunctional Rcoli vector such as H3LUESCRIPT (Stratagene, La Jolla CA) cat PSPORTl 
plasndd (Life Technologies). Ligatira of sequences encoding DME into the vector's multiple cloning 
site disnqpts the lacZ gpne, aQowing a colorimetric screening procedure for identificaticm of 

20 traiisformed bacteria coiitainingrecoriibinant molecules, hi addition, these vectors may be useful for 
jg vitro transcription, dideoxy sequencing, single strand rescue with helper phag^, and creation of 
nested ddetioiis in the cloned sequence. (See, e.g.. Van Heefae, G. and S.M. Schuster (1989) J. BioL 
ChmL 264:5503-5509.) When large quantities of DME are needed, e.g. for the production of 
antibodies, vectors lirtiichdireahig^ level esqxresdonofDME may be used. Fen* example, vectms 

25 containing tiie strong, indudble SP6 or T7 bacteriophage promoter may be used. 

Yeast e^qoression systems may be used for productim of DM^ A nuniberrf vectors 
containing constitutive or inducible promoters, such as a^dia faclcHr, alcohol oxidase, and PGH 
pffmiotejs.maybeusedintheyeast Saccha rom Ycescerevisiae orPicW^ pagitftria- In adffition, sudi 
vectors direct either the secretion or intraceOular retention of expressed protdnis and ooable 

30 integration offoreign sequences into the host g^orne for stable propagation. (See, e.g., Ausubel, 
1995, supra: Bitter, GA. et al. (1987) Methods EnzymoL 153:516-544; and Scorer, CA. et al. (1994) 
Bionrechnology 12:181-184.) 

Plant systems may also be used for esqnression of DME. Transcription of sequences encoding 
DME may be driven by viral promote, e.g., the 35S and 19S promoters of CaMV used alone or in 

35 comhmation witii the omega leader sequence from TMV (Takamatsu, N. (1987) EMBO J. 
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6:307-311). Alternatively, plant promoters such as the small subimit of KUBISCO or heat shock 
promoters may be used. (See, e.g., Coruzzd, G. et al. (19S4) EMBO J. 3: 1671-1680; Broglie, R. et al. 
(1984) Science 224:838-843; and Winter, J. et al. (1991) Results Probl. Cell Differ. 17:85-105.) 
These constructs can be introduced into plant cells by direct DNA transformation or 

5 pathogen-mediated transfection. (See, e.g„ The McGraw Hill Yearbook of Science and Technology 
(1992) McQraw Hill, New YorkNY, pp. 191-196.) 

Tti tnaTntnalinn rftik, a miTnhftr ftf viral-hased eitpressinii systems may be utilized, fiacases 
where an adenovirus is used as an expression vector, sequences ^coding DME may be ligated into an 
adeaovirus transcription/lranslation con^lex consisting of the late promofm' aikl tripartitB leader 

10 sequence. Insertion in a non-«ssential El or E3 region of the viral genome may be used to obtain 
infective virus which expresses DME in host cells. (See, e.g., Logan, J. and T. Shenk (1984) Proc. 
Natl. Acad. Sd. USA 81:3655-3659.) In addition, transcription enhancers, such as the Rous sarcoma 
vims (RS V) enhancer, may be used to increase e:;qMression in mammalian host cells. SV40arEBV- 
based vectors may also be used for high-level protein expressim. 

15 Human artificial chromosomes (HACs) may also be employed to deliv^ larg^ fragments of 

DNA than can be contained in and e^qpressed from a plasmid. HACsof about6kbto IGMbare 
cmstructed and delivered via conventional deliv^ methods (liposomes, polycationic amino 
polymers, or vesicles) for therapeutic purposes. (See, e.g., Haningtcm, J J. et aL (1997) Nat Genet 
15:345-355.) 

20 For long teaa production of recombinant protems in mammalian systems, stable expression 

ofDME in cell lines is preferred. For exai)q>le, sequences encoding DMB can be transformed into 
cell lines using expiessi(Hi vectors which may ccmtain viral origfais of replication and/or endogenous 
exi»essicmelra»its and a selectable marker gene on the same or on a separate Followmgdie 
introductiQn of the vectcr, cells may be allowed to grow for abcmt 1 to 2 days in enridied media 

25 before being switched to selective media. Hie purpose of the selectable marioer is to confer resistance 
to a selective ag^nt, and its presence allows growth and lecoveiy of cells which successfully express 
the introduced sequences. Resistant clones of stably transformed cdls may be propagated usit^g 
tissue culture techniques q)proixiate to the ceJl type, 

Anynuinberof sdtection systems noay be used to recover transformed ceU These 

30 include, but are not linuted to, the herpes siaq>lex virus thymidine kinase and adenine 

phosphoribosyltransferase g^es, for use in tk' and apf cells, respectively. (See, e.g., Wigler, M et 
al. (1977) Cell 11:223-232; Lowy, L et aL (1980) CteU 22:817-823.) Also, antimetaboUte. antibiotic, 
or herbicide resistance can be used as the basis for selection. For example, confers resistance to 
methotrexate; neo confecs resistance to the aminoglycosides neomycin and G418; and als aDdpat 

35 confer lesistaiice to cUorsnlfuron and phosphinotridn acetyltrai^^ (See,e.g., 
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Wigler, M. et al. (1980) Ploc. NatL Acad. Sd. USA 77:3567-3570; Colbeie^iai:^. F. et al. (1981) 
J. Mol. Biol. 150:1-14.) Additional selectable genes have been described, e.g., trpB and hisDj which 
alter cellular leqnixements for metabolites. (See, e.g., Hartman, S.C and R.C Mulligan (1988) Ptoc. 
Nad. Acad. Sd. USA 85:8047-8051.) Visible markers, e.g., anthocyanins, green fluorescent proteins 
5 (OPP; Qontech), fi glucuronidase and its substrate B-glucuronide, or luciferase and its substrate 
luciferinmay beused. These iiuurkers can be used not oidy to identify transfornmts, but al^ 
quantify the amount of transient or stable protein esxptGSSioa attributable to a specific vector system. 
(See, e.g., Rhodes, CA. (1995) Methods Mol. Biol. 55:121-131.) 

Although the presence/absmce of marker gpne expression suggests that the gene of interest is 

10 also pies^ the presence and e^qiression of the gene inay need to be coofirn)^ For exa^^>le, if the 
seguraice encoding DME is inserted within a marker gene sequence, transformed cells containing 
sequ^cesencodingDMEcanbeidentifiedby the absoace of marker gene Alternatively, a 

marker g^ can be placed in tandem with a sequence encoding DME under the control ^ 
promoter. E^qnession of the maikor gene in respcmse to induction or selection usually indicates 

15 e^qnression of the tandem g^e as well. 

In general, host cells that ccmtain the nucleic acid . sequence encoding DME and that express 
DMEmaybeidentifiedby a variety of procedures known to those of skill in the art These 
procedures include, but are not limited to, DNA-DNA or DNA-RNA hybridizations, PCR 
amplification, and protein bioassay or immunoassay techniques which include membrane, solution, or 

20 chip based technologies for the detection and/or quantification of nucleic acid or protein sequences. 

Immunological methods for detecting and measuring die eqnression of DME using either 
spedfic polyclonal or monoclcmal antibodies are known in the art Bxanq>les of such techniques 
include CTzym&4inked immunosoibCTt assays (EOS As), radicnmmunoassays (RIAs), and 
fluorescence actrvated cell sorting (FACS). A two-site, monocloiial-tesed imniunoassay utilizing 

25 manoclcmal antibodies reactive to two non-interfering qriitqpes on DME is preferred, but a 

conqtetitive binding assay may be en^loyed. These and other assays are well known in the art (See, 
e-p., Hanyton> R. et aL (1990) Serriopical Methods, a T ^iwntnrY Mimnftl , Sr Piaiil MN. 

Sect IV; Coligan, J£. el aL (1997) Current Ptotpcols in fammnologv, Greene Pub. Associates and 
Wiley-hiterscience, New Yoric NY; and Pound, J.D. (1998) fmnmnnrh endcal Protocols, Humana 

30 Ftess,TotowaNJ.) 

A wide variety of labels and conjugation tedmiques are known by those skilled in the art and 
may be used in various nucleic acid and amino acid assays. Means for producing labeled 
hybridization or PCR probes for detecting sequences related to polynucleotides encoding DME 
include oligolabeling, nick translation, md-labeling, or PCR amplification using a labeled nucleotide. 

35 Alternatively, the sequences encoding DME, or any ftagments th»eof^ may be cloned into a vector 
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for the production of an mRNA probe. Such vectors are known in the art, are connnerciaUyavailabte^ 
and may be used to syndiesize SNA probes m vitro by addition of an apprqxriate RNA polym^ase 
such as T7,T3, or SP6 and labeled nucleotides. Tliese procedures may be conducted using a variety 
of commercially available kits, such as those provided by Amersham Pharmacia Biotech, Promega 
5 (Madison WQ, and US Biochemical. Suitable reporter molecules or labels which may be used for 
ease of detecticm include radionuclides, enzymes, fluorescent, chemilummescrat, or chromogenic 
agents, as well as substrates, cofactors, inhibitors, magnetic particles, and the like. 

Host cells transformed with nucleotide sequences encoding DMB may be cultured under 
conditions suitable for the ^qpiession and recovery of the protein firom cell culture. Tlie protem 

10 produced by a transformed cell may be secreted or retained intraceUularly depending on (he sequence 
and/or the vector used. As wiU be understood by those of sldU in the art, e^^nession vectors 
containing polynucleotides which encode DMB may be designed to contain signal sequences which 
direct secretion of DMB through a prokaryotic or eukaryotic cell membrane. 

In addition, a host cell strain may be chosen for its ability to modulate e^qnession of the 

15 inserted sequences or to process the e;q>ressed protein the desued fashion. SuchmodiBcationsof 
the polypeptide include, but are not lunited to, acetylation, carboxylation, glycosylation, 
phosphorylation, lipidation, and acylatioiL Post-temslaticmal processing which cleaves a '"prepro" or 
"pro" form of the protein may also be used to specify protein targeting, folding, and/or activity. 
Diffoc^ host cells which have specific cellular machinery and characteristic mechanisms for 

20 post^ranslational activities (e.g., CHO, HeLa, MDCK, HEK293, and WB8) are available fit>m the 
American Type Culture Collection ( ATCC, Manassas V A) and may be chosen to ensure the coaect 
modification and processing of the foreign protein. 

hi anodier embodiment of the invention, natural, modified, or recombniant nucleic acid 
sequences encoding DMB may be ligated to a heterologpus sequrace resulting in translation of a 

25 fusion protein in any of the aformirationed host systeoQS. For example, a chimeric DMB protein 
containing a h^erologous moiety tiiat can be recognized by a commercially available antibo^ may 
fknUtate ttie screening of pq>tidelibotaries for inMfait^ Heterologous proteni and 

peptide moieties may also facilitate purificatim of fusion proteins using commerdaDy available 
afiSnity matrices. Such nKneties include, but are not limited to, glutathione S-transf»ase (GST), 

30 maltose binding protein (MBP), thioredoxin (TDt), calmodulin binding peptide (GBP), FLAG, 
c-n^Cy and hemagglutinin (HA). GST, MBP, Trx, CBP, and 6-His enable purification of their 
cognate fusion proteins on immobilized glutathione, maltose, phenylarsine oxide, calmodulin, and 
metal-chelate resins, respectively. FLAG, c-mycy and hemagglutinin (EIA) enable immimoaffinity 
purification of fusion proteins using commmially available monoclonal and polyclcml antibodies 

35 that specifically recognize these epitope tags. A fusion protein may also be engiueered to contain a 
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proteolytic cleavage site located betwem Ae DMB encoding sequence and the het^logous protein 
sequence, so that DME may be cleaved away from the heterologous moiety followmg purification. 
Methods for fusion protein ejqnession and purification are discussed in Ausubel (199S, supra, ch. 10). 
A variety of commercially available kits may also be used to fadlitale expression and purification of 
5 fusion proteins. 

hi a further embodiment of the invention, syndiesis of radiolabeled DME may be achieved in 
vitro using the TNT rabbit reticulocyte lysate or wheat geim extract system (Ptomega). These 
systems coi^le transcription and translation of protein-coding sequences operably associated with the 
T7,T3, or SF6 promoters. lYamlation takes place in the presence of a radiolabeled amino acid 

10 precursor, far example ^^S-methionine. 

DME of the preset invention or fiagmiraits tixrateof may be used to screen for compounds that 
specifically bind to DME. At least one and up to a plurality of test compounds nmy be screened for 
specific binding to DME. Examples of test conq^ounds include antibodies, oligonucleotides, proteins 
(e.g., receptors), or small molecules. 

15 In one embodimrat, the confound thus identified is closely related to the natural ligand of 

DME, e.g. , a ligand or fragment thereof, a natural substrate, a structural or functional mimetic, or a 
natural bindmg partner. (See, e.g., Coligan, JB. et al. (1991) Current Prot nmlg tn TTnmimnlnffY 1(2): 
Chapt^S.) Siinilariy, the cQnqK)und can be closely related to die iiatural receptor 
bmds, or to at least a fragment of the receptor, e.g., the Ugand binding site. Ineidi^case, the 

20 conqxiund can be rationally designed using known techniques. In one embodiment, soe^iing for 
these conq>ound5 involves producing appropriate cells which ejqness DME, either as a secreted 
protein or on the ceU membrane. Preferred cells include cells from mammals, yeast, Drosophila, or 
E. colL Cblls expressing DME or ceU membrane fractions which contain DME are tiien contacted 
with a test compound and binding, strmnlatiaa, or inbihition of activity of dtfaerDMB or the 

25 conq[x>und is analyzed. 

An assay may dmpty test bindiiig of a test compound to tfaepolypqitide, wherein Ubading is 
delected by a flnoiq[»horc, radioisotope, enzyiDB conjugate, or otha Fcrexanple, 
' tiie assay inay coiipjse the steps of combiiiing at least on^ 

solutim or afiBxed to a solid sq^port, and d^ecting die binding of DMB to the con^und. 
30 Alternatively, die assay may detect or measure binding of a test compound in the presence of a 

labeled conqpetitor. Additionally, the assay may be carried out using cell-free preparations, chemical 
libraries, or natural product nciixtures, and the test compound(s) may be free in solution or afBxed to a 
solid support. 

DME of the present invration or firagmmits thereof may be used to screen for compounds that 
35 modulatBtfaeactivity of DMB. Such cornpoundsinayindude agonists, antagonists, or partial or 
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inverse agonists. In one enibodiniBnt, an assay is perfonned under 

activity, wherdn DME is c(»nbined with at least one test compound, and the activity of DME in the 
presence of a test compound is conq>aied with the activity of DME in the absence of the test 
compound. Achangeintheactivity of DME in die presence of die test compound is indicative of a 

5 compound that modulates the activity of DME. Alternatively, a test con^und is combined with an 
invitco or cell-fiee system conq[nising DME under conditions suitable for DME activity, and the 
assay is performed. In either of these assays, a test caaspoimd which nnodulates the activity of DME 
may do so indirectiy and need not come in direct contact with the test confound. At least one and up 
to a plurality of test conqiounds may be screened. 

10 hi another embodinKxit, polynucleotides encodii^ DME or their mammalian homolcgs may 

be 'Ifflocked out* * in an animal model system using homologous recombinaticm in enihryonic stem 
(BS}cells. Such techniques are weU known in the art and axe useful for the genenition of ani^ 
models of human disease. (See, e.g., U.S. Patent No. 5,175,383 and U.S. Patent No. 5,767,337.) For 
exaniple, mouse ES cells, such as the mouse 129/SvJ cell line, axe derived fiom the early mouse 

15 embryo and grown in culture. The ES cells are transformed with a vector ccmtaining die g^e of 

interest disrupted by a marker g^ne, e.g., the neomycm phosphotransferase gqae (neo; Capecchi, MJR. 
(1989) SciCTce 244:1288-1292). The vector integinites into the correspondu^ region of the host 
genome by homologous xeccHnbination. Alt^natively, homologous recombination takes place using 
' die Cr&-loxP system to knockout a gene of interest in a tissue- or developmental stage-specific 

20 manna (Marth, J J>. (1996) Clin. Invest 97: 1999-20Q2; Wagna, K.U. et al. (1997) Nucleic Acids 
Res. 25:4323-4330). Transformed ES cells are identified and microinjected into mouse cell 
blastocysts such as tiiose from the C57BL/6 mouse strain. Tlie blastocysts are surgically transferred 
to pseudopregnant dams, and the resulting chimeric progeny are g^otyped and bred to produce 
hetotozygous or homozygous strains. Transg^c animals thus generated may be tested widi potential 

25 therapeutic or toxic agents. 

Polynucleotides encoding DME may also be manipvdated in vitro in ES cells derived from 
human blastocysts. Ihnnan ES cells have the potocttial to differentiate into at least serrate cell 
Kncages inchiding endoderm, mesodmn, and cctodmnal cell types. These cell lineages dififecentiate 
into, for example, neural cells, hoooatopoietic lineages, and cardiomyocytes (Thomson, J.A. el aL 

30 (1998) Science 282:1145-1147). 

Polynucleotides mcoding DME can also be used to create '*knockin** humanized animals 
(pigs) or transgenic animals (znice or rats) to ncKidel human disease. With knockin technology, a 
region of a polynucleotide «oux>ding DME is injected into ammal ES cells, and die injected sequ^ice 
integrates into the animal cell genome. Transfomied cells are injected into blastulae, and the 

35 blastulae are implanted as described above. Transgenic progeny or inbred lines are studied and 
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treated with potential phammceutical ag^ts to obtain tnf onnatian on trcatment of a human disease. 
Alternatively, a manmial infaied to ov e r e xpiess DMB^ e.g., by secreting DME in its milk, may also 
serve as a convenient source of that protein (Janne, J. et al. (1998) BiotechnoL Anna. Rev. 4:55*74). 
THERAPEUTICS 

5 Qiemical and structural similarity, e.g., in the context of sequences and motife, exists 

between regions of DME and drug metabolizing enzymes. la addition, exasaplcs of tissues 
expressing DME can be found in Table 6. Therefore, DME appears to play a role in 
autoinmmnefinflaminatQiy, cell proliferative, neurological, developmental, endocrine, eye, metabolic, 
and gastrointestinal disoiders, including liver disQiders. In the treatment of disoid^ associated with 

10 increased DME expressioo or activity, it is desirable to decrease tfie expression or activity of DME 
In the treatment of disorders associated with decreased DME expression or activity, it is dedrable to 
increase the e3q>ression or activity of DME. 

Therefore, in one embodimait, DME or a fragment or derivative tiiereof may be administered 
to a subject to treat or prevent a disordo: associated with decreased expression or activity of DME. 

15 Examples of such disordm include, but are not limited to, an autoimmune/inflammatory disorder, 
such as acquired timnunodeficiracy syndrome (AIDS), Addison's disease, adult respiratory distress 
syndrome, allergies, ankylosing spondylitis, amyloidosis, anemia, asthma, atherosclerosis, 
autoimmune hemolytic anama, autoimmune thyroiditis, autoimmune polyendocrinopathy- 
candidiasis-ectodeEmal dystrophy (AFECED), bronchitis, cholecystitis, contact dennatitis, Crohn's 

20 disease, atc^c dermatitis, domatcmyositis, diabetes mellitus, enq>hysema, episodic lymphopCTia 
witii lynqxhocytotoxins, erythroblastosis fetalis, erydiema nodosum, atrophic gastritis, 
gtomeralooephritis, Gooc^astuie's syndrome, gont. Graves' disease, Hashimoto's thyroiditis, 
hypereosinophilia, irritable bowel syndrome, multiple sclerosis, myasthenia gravis, niyocardial or 
pericardial inflammation, osteoarthritis, osteoporosis, pancreatitis, polymyositis, psoria^ Rioter' s 

2S syndrome, riieumatoid arthritis, sderodenna, SjC^grcn's syndrome, systemic an^hylaxis, systemic 
lupus erythematosus, systemic sclerosis, flnombocytopenic purpura, ulcerative colitis, uveitis, Werner 
syndrome, complications of cancer, faemodiatysis, and extracofpareal drcnlatiosi, viral, bacterial, 
fungal, parasitic, protozoal, and hehnintiiic infection^ 

as actinic keratosis, arteriosclerosis, atlierosclax)sis, bursitis, cinfaosis, hepatitis, mixed connective 
30 tissue disease (MCTD), myelofibrosis, paroxysmal nocturnal hemog^lolnnuria, polycythemia vera, 
psoriasis, primary thrombocythemia, and cancers including adenocarcinoma, leukerma, lymphoma, 
melanoma, myeloma, sarcmna, teratocaicinoma, and, in particular, cancers of die adrenal gland, 
bladder, bone, bone marrow, brain, breast, cervix, gall bladder, gangjia, gastrointestinal tract, heart, 
kidney, liver, lung, muscle, ovary, pancreas, parathyroid, penis, prostate, salivary gjbnds, skin, sple^ 
35 testis, thymus, diyroid, and uterus; a nemolpgical disanl^ such as qvilepsy, ischemic cerebrovascular 
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disease, stroke, cerebral neoplasnis, Alzhdxner's disease. Pick's disease, Hundngtoa's disease, 
dementia, Paridboson's disease and othor extrapyiamidal disorders, amyotrophic lateral sclerosis and 
other motor nearon disorders, progressive neural muscular atrophy, retinitis pigtmraitosa, hoeditaiy 
ataxias, multiple sclerosis and other demyelinating diseases, bacterial and viral mraingitis, brain 

5 abscess, subdural enq>yema, epidural abscess, suppurative intracranial fbrombophlebitis, myelitis and 
radiculitis, viral cratral nervous system disease, prion diseases including kuru, Creutzfeldt-Jakob 
disease, and G^stmann-Straussler-Scheinker syndrome, fiital fiunilial insomnia, nutritional and 
metabolic diseases of the n^ous system, neurofibromatosis, tuberous sclerosis, cexebelloietmal 
hemangioblastoinatosis, encephalotrigeminal syndrome, mental retardation and other developmental 

1 0 disorders of the central nervous system including Down syndrome, cerebral palsy, neuroskeletal 
disorders, autcnomic nervous system disorders, cranial nerve disorders, spinal cord diseases, 
muscular dystrophy and other neuromuscular disorders, peripheral nervous Systran disciders, 
deimatomyositis and polymyositis, inherited, metabofic, endocrine, and toxic myopathies, 
myastibtenia gravis, periodic paralysis, mental disorders including mood, anxiety, and schizophrenic 

15 disorders, seasonal affective disorder (SAD), akathesia, amnesia, catatonia, diabetic neurppatiiy, 
tardive dyskmesia, dystonias, parancnd psychoses, posOierpetic neuralgia, Tourette's disorto, 
progressive supranuclear pal^,corticobasaldegmeratiQn,aiid£ami^ a 
developmmtal disorder, such as renal tubular acidosis, anemia, Cushing's syndrome, achondroplastic 
dwarfism, Duchenne and Becker muscular dystrophy, epnqpsy , gonadal dys^esis, WAGR 

20 syndrome (Wilms' tumor, aniridia, g^tourinary abnormalities, and mental retardation), Smith- 
Magenis syndrome, myelodysplastic syndrome, hereditary mucoepithelial dysplasia, hereditary 
keratodetmas, hereditary neuropatiues such as Charcot-Mari&Tooth disease and neurofibromatosis, 
hypotbyioidism, hydroce|dialus, seizure disorders such as ^ndenham's chorea and cerebral palsy, 
spina Infida, anencq^ialy , cianiorachisdusis, ccng^tal glaucoma, cataract, and saisorineural 

25 hearing loss; an endocrine disorder, such as disordCTS of the hypothalamus and pituitary resulting 
from lesions such as primary brain tumors, adoKHnas, infarcticm associated with pr^;naiK:y, • 
hypophysectomy , aneurysms, vascular malfonnations, thrombosis, infecticms, immunological 
disordeats, and complications due to head trauma; disorders associated with hypc^utuitarism including 
hypogonadism, Shedian syidrome, diabetes insipidns, KaUman's disease, Hand-^hnll^-Qiristian 

30 disease, Letterer-Siwe disease, sarcoidosis, errq>ty sella syndrome, and dwarfism; disorders associated 
with hyperpituitarism including acromegaly, giantism, and syndrome of inappropriate antidiuretic 
hormone (ADH) secretion (SIADH) often caused by benign adenoma; disorders associated widi 
hypothyroidism including goiter, myxedema, acute thyroiditis associated with bacterial infection, 
subacute thyroiditis associated with viral infection, autommume thyroiditis (Hashimoto's disease), 

35 and cretinism; disorders associated with hypothyroidism including thyrotoxicosis and its various 
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forms, Giave*s disease, piedbial myxedema, toxic multinodular goiter, tt&yioid caidnoma, and 
Plummer*s disease; disorders associated with hyperparatbyroidism including Conn disease (chronic 
hyp^calemia); pancreatic disorders such as Type lor Type n diabetes melfitns and associated 
complications; disorders associated witii the adrenals such as hypeqplasaa, carcinoma, or adenoma of 
5 the adrenal cortex, hypertension associated witii alkalosis, amyloidosis, hypokalemia, Cushing's 
disease, Liddle's syndrome, and Amold-Healy-Gordon syndrome, pheochromocytoma tumors, and 
Addison's disease; disorders associated with gonadal steroid hormones such as: in women, abncxmal 
prolactin production, infertility, endometriosis, p^turhadons of the menstmal cycle, polycystic 
ovarian disease, hyperprolactinemia, isolated gonadotropin deficiency, amencnrfaea, galactcxrhea, 

10 hennaphroditisin, hirsutism and virilization, breast cancer, and, in post-menopausal women, 

osteoporosis; and, in men, Leydig cell defid^y, male climacteric phase, and gonninal cell aplasia, 
hypergonadal disorders associated with Leydig cell tumors, androgen resistance associated with 
abs^ice of androgen receptors, syndrome of 5 a-reductase, and gynecomastia; an eye disorder, such 
as conjunctivitis, keratoconjunctivitis sicca, kmtitis, episcleritis, iritis, posterior uvdtis, glaucoma, 

IS amaurosis fugax, isdiemic optic neuropathy, optic neuritis, Leber's hereditary optic neuropathy, toxic 
cfpdc neuropathy, vitreous detachment, retinal detachment, cataract, macular degoneration, central 
serous chorioretinopathy, retinitis pigmentosa, melanoma of the choroid, retrobulbar tumor, and 
chiasmal tumor; a metabolic disorder, such as Addison's disease, cerebrotendinous xanthomatosis, 
coii^genital adrraial hyperplasia, coumaiin resistance, cystic fibrosis, diab^es, fatty hq^atocirrhosis, 

20 fructose-l,6-dqphosphatase defidoicy, galactosemia, goiter, glucagonoma, glycog^ storage diseases, 
hereditary fiructose intolerance, hyperadrenalism, hypoadrenalism, hyperparathyroidism, 
hypoparatiiyroidismi, hypercholestBatolemia, hyperthyroidism, hypoglycemia, hypothyroidism, 
l]9pedq>idemia, hypedqpemia, lq>id myopatiiies, lipodystrc^faies, lysosomal stora^ diseases, Menkes 
syndrwne, occipital hom syndrome, mannosidosis, neuraminidase deficiency, obesity, pentosuria 

25 phenylketonuria, pseudovitaminD-defidoobcy rickets; hyi^^ 

pos^Nibescent cerebellar ataxia, and tyrosinemia, and a gastrointestinal disorder, such as dysphagia, 
peptic esophagitis, esoj^iageal spasm, esc^rihagpal stricture, esojAagealcaKanama, dyspepsia, 
indigestion, gastritis, gastric carcinoma, anoiexia, nausea, emesis, gastroparesis, antral or pyloric 
edema, abdominal angina, pyrosis, g^stroentmtis, intestinal obstructim, infecticms of the intestinal 

30 tract, peptic ulcer, cholelithiasis, cholecystitis, cholestasis, pancreatitis, pancreatic carcinoma, biliary 
tract disease, hepatitis, hyperbilirubinemia, heiieditary hyperbilirubinemia, cirrhosis, passive 
congestion of the liver, hepatoma, infectious colitis, ulcerative colitis, ulcerative proctitis, Crohn's 
disease, Whipple's disease, Mallory-Weiss syndrome, colonic carcinoma, colonic obstruction, 
irritable bowel syndrome, short bowel syndrome, diarrhea, constipation, gastrointestinal hemorrhage, 

35 acquired immunodeficiracy syndrome (AIDS) ^teropathy, jaundice, hqmtic encephalopathy. 
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hepatoienal syndrome, hepatic steatosis, heuoochromatosis, ^^scm's disease, alpha^aiititrypsm 
deficiency, Reye's syndrome, primaiy sclerosing cholangitis, ]iver in&iction, portal vein obstmctiQa 
and thrombosis, centrilobnlar necrosis, peHosis hepatis, hepatic vein tfaromibosis, venoHXxlusive 
disease, preeclampsia, eclanq>sia, acute &tty liver of pregnancy, intrahepatic cholestasis of 
5 pregnancy, and hqiatic tumors including nodular hyperplasias, adenomas, and carcinomas. 

Jn another embodiment, a vector capable of e^qpiessing DME or a fiagment or derivative 
thereof may be administered to a subject to treat or prevent a disorder associated witti decreased 
expression or activity of DME including, but not limited to, those described above. 

In a further embodiment, a con^osition conqxrising a substantially purified DME m 
10 conjunctim with a suitable pharmaceutical carrier may be administered to a subject to treat or prevent 
a disorder associated with decreased eTcpiession or activity of DME including, but not limited to, 
those provided above. 

In stfll another CTibodiment, an agonist which modulates the activity of DME may be 
admhustered to a subject to treat or prevent a disorder associated with decreased expression or 
15 activity of DME including, but not limited to, those listed above. 

In a furtfxer embodimmt, an antagonist of DME may be administated to a subject to treat or 
prev»t a discHder associated with increased expres^on or activity of DME. Examples of such 
disorders include, but are not limited to, those autoimmune/mflammatory, cell prolifistative, 
neurological, developmental, endocrine, eye, metabolic, and gastrointestinal disorders, including liver 
20 disorders described above. In <me aspect, an antibody wUchspedficaUy binds DME may be used 
directly as an antagonist or mdirectly as a tar^tfpg or delivery mechanism for faring^g a 
pharmaceutical agent to cells or tissues which express DME. 

In an additional embodiinent, a vector esqiiessing the conqdement of the polynw^ 
encoding DME may be administered to a sulqect to treat or prevoit a disoi^ 
25 increased expression or activity of DME including, but not linnted to, those described above. 

Inoiher embodiments, any of tbeprotdns, antag^mists, antibodies, agsxiists, conqilemesitary 
sequoKces, or vectoss of the invention may be adininislered in CQin^^ 

thoqientic agents. Selecticxi of the appropriate agsots for in combination therapy may be made 
byoneof ordinary skill in the art, according to ccmventionalph^ai^^ The 

30 combinaticm of tl^fi^utic a^ts may act synergisticaUy to effect the treatment or prevention of the 
various disorders described above. Usmg this qyproach, one may be able to achieve therapeutic 
efBcacy with lower dosages of each agent, thus reducing the potential for adverse side effects. 

An antagonist of DME rnay be produced using methods wUdi are g^seraUy kn^ 
Id particular, purified DME may be used to produce antibodies or to screen libraries of 

35 i^barmaceutical agq[its to identify those which specifically huid DME. Antibodies to DME may also 
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be g^emted using methods that are weU known in Such antibodies may include, but are not 

limited to, polyclonal, monoclonal, chimeric, and single chain antibodies. Fab fragments, and 
fragments produced by a Fab expression library. Neutralizing antibodies (Le., tihose winch inhibit 
dimer formation) are generally preferred for therapeutic use. 
5 For the production of antibodies, various hosts including goats, rabbits, rats, mice, humans, 

and otibers may be inmnmized by injection with DME or m 

which has immunogenic properties. Depending on the host species, various adjuvants may be used to 

increase immunological response. Such adjuvants include, but are not limited to, Freund's, mineral 

gels such as aluminum hydroxide, and surface active substances such as ly soledithin, plurosuc 
10 polyols,polyanions,p^des, oil emulsions, KUI, and dinitrophenol. Among adjuvants used in 

humans, BCG (badlli Calmette-Guerin) and Corynebacteriumparvum ace especially preferable. 

It is preferred (hat the oligopeptides, peptides, or fragments used to induce antibodies to DME 

have an amino acid sequence consisting of at least about 5 amino acids, and generally will consist of 

at least about 10 amino adds. It is also prefenible that these oligopeptides, peptides, or fraginents are 
15 identicaltoaportionof the annuo acid sequ0«:e. of the natural proteinu Short stretches of DME 

ammo adds may be fused with those of another protein, such as KLH, and antibodies to the chimeric 

molecule may be produced. 

Monoclonal antibodies to DME may be prepared using any tsechnique which provides for the 

production ofantibody molecules by continuous ceU lines in culture. These include, but are not 
20 limited to, the hybridoma technique, the human B -cell hybridama technique, and tihe EB V-bybridoma 

technique. (See, e.g., Kohler, G. et al. (1975) Nature 256:495-497; Kozbor, D. et al. (1985) J. 

finmunol. Methods 81:31-42; Cbte, RJ. et al. (1983) Proc. Natl. Acad. Set USA 80:2Q26-203ft and 

Cole, S.P. et aL (1984) Md. C^Biol. 62:109-120.) 

In addition, techniques developed for the production of **clumBric antibodies,'* such as the 
25 splicingofnx)useantibo<fy genes to human antibodty genes.to oi^ain a molecule with appropriate 

antigm qiecifidty and biological activity, can be used. (See, e.g., Morrison, SX. et aL (1984) FToc. 

Nati. Acad. ScL USA 81:6851-€855; Nwibogpr, M.S. et aL (1984) Nature 312:604^; and Takeda. 

S. et al. (1985) Nature 314:4S2h454.) Attemativdy, techniques described for the poroduction of sin^ 

chain antibodies may be adgytPiT; using m^ods known in the ait, to produce DME^specific single 
30 chain antibodies. Antibodies with related speciBcity, but of distinct idiotypic coiiq)osition, may be 

generated by chain shufOing from random combinatorial immunoglobulin libraries. (See, e.g.. 

Burton, DJR. (1991) Proc. Nati. Acad. Sd. USA 88:10134-10137.) 

Antibodies may also be produced by inducing in vivo production in the lynq)hocyte 

population or by screening immunoglobulin libraries or panels of highly specific binding reag^ts as 
35 disclosed m tiie literature. (See, e.g., Qrlandi, R. et al. (1989) Proc. Nati. Acad. ScL USA 
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86:3833-3837; Winter, G. et aL (1991) Nature 349:293-299.) 

Antibody jEragments which contain specific binding sites for DME may also be geQ^:ated. 
FQrexanqple,si]chfiagmrat5]nchide>butaienotlinutedto,F^^^ produced by pq>s]n 

digestion of the antibody molecule and Fab ftagnents generated by leducing the disulfide bridges of 

5 the F(ab')2 firagments. Alternatively, Fab expression libraries may be constnicted to allow lapid and 
easy idrntificadon of monoclonal Fab fragments with the desired specificity. (See, e.g.. Hose, W.D. 
et al. (1989) Science 246:1275-1281.) 

• Various immunoassays may be used for scre^ung to identify antibodies having the desired 
specificity. Numerous protocols for competitive binding or immunoradiometric assays using dther 

10 polyclonal or monocloiial antibodies with established specrfidties are weUl^ Such 
immunoassays typically involve the measurement of complex formation between DME and its 
specific antibody. A two-site, monoclcmal-based immunoassay utilizing monoclcnial antibodies 
reactive to two non-interferuag DME epitcqies is generally used, but a competitive binding assay may 
also be errq>loyed (Pound, supra) . 

15 Various methods such as Scatchard analysis in ccmjunction witii radiohnmunoassay 

tecfaniquesmaybeusedtoassess the affinity of antibodies for DMR Affinity is expressed as an 
association ccmstant, which is defined as the molar concentration of DME-antibody complex 
divided by the molar concentrations of free antigen and free antibody under equilibrium conditions. 
The determined for a preparation of polyclonal antibodies, which are heterogeneous in thdr 

20 afBnities for nudtiple DME epitopes^ represents the average affinity, or avidity, of the antibodies for 
DME. The determined fiEsr a preparation of monoclonal antibodies, which are rnonos^^ 
particular DME epitope, rqKTesents a true measure of affinity. EB^HafBnity antibody preparations 
wilh raiigmg from alxmt 10^ to 10^ LAnole are prdfe^^ 

DME-antibody conq^lex must withstand rigorous manq)ulations. Low-nafiBnity antibody prq>arati(xis 
25 withK^rangjbogfromalxwtlO^tDlO^LADoleareprefenedforTO 

procedures which ultimately require dissociation of DME, preferably in active form, from tibe 
antibody (Catty, D. f iQM) Anrihniliftft, Vnlitmft T; A Pract ical Ap^macfa, IRL Press, Washington DC; 
liddell, J JB. and A. Cryer ( 199 1) A Ptectical Guide to Monn rJnnal Antibniltfta^ John Wiley & Sons, 
NewYoritNY). 

30 The titer and avidity of polyclonal antibody preparations may be further evaluated to 

determinethequatity arid smtabifity of such preparations for certain downstream ^pUcati For 
exarrqple, a polyclonal antibody preparation containing at least 1-2 mg specific antibody/ml, 
preferably 5-10 mg specific antibody/ml, is generally enq>]oyed in procedures requiring precipitation 
of DM&antibody coni^Iexes. Procedures for evaluating antibody specificity, titer, and avidity, and 

35 guidelines for antibody quaUty and usage in various applications, are generally available. (See, e.g.. 
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Catty, supra, and Cbligan et al. supra, ) 

Jn another embodiment of the mventioii, the polynucleotides encoding DME, or any fragment 
or complernentth^eoi^ may be used for therapeutic purposes, bone aspect, modifications of gene 
expression can be achieved by designing complementary sequences or antisense molecules Q^NA, 

5 RNA, PNA, or modified olig(mucleotides) to the coding or regulatory regions of the ^ne encoding 
DME. Such technology is well known in the art, and antisense oligonucleotides or larger fragments 
can be designed from various locations along the coding or control regions of sequences encoding 
DMR (See, e.g., AgiawaL S., ed. (1996) Antisense Therapeutics. Humana Press Inc., Totawa NF.) 
In thraapeutic use, any g^e delivery system suitable for introduction of the antisense 

10 sequences into appropriate target cells can be used Antisense sequences can be delivea^ed 

intracellularly in the form of an expression plasmid which, vapaa transcription, produces a sequence 
conqplmientaiy to at least a portion of the ceUu]ar sequence «icoding the target piot^ (See,e.g., 
Slater, JJB. et aL (1998) J. Allergy Clin. ImmunoL 102(3):469-475; and Scanlon, KJ. et aL (1995) 
9(13):1288-1296.) Antisense sequences can also be introduced intracellularly through the use of viral 

15 vectors, such as retrovirus and adeno-associated virus vectors. (See, e.g.. Miller, A J>. (1990) Blood 
76:271; Ausubel, supra: Uckert, W. and W. Wahher (1994) PharmacoL Ther. 63(3):323-347.) Other 
gene delivery mechanisms include liposome-derived systems, artificial viral envelopes, and other 
systems known in flie art (See, e.g„ Rossi, JJ. (1995) Br. Med. Bull. 51(l):217-225; Boado, R.J. et 
al. (1998) J. Pharm. ScL 87(11>.1308-1315; and Morris, M.C et aL (1997) Nucledc Adds Res. 

20 25(14):2730-2736.) 

In' another enibodiment of the invention, polynucleotides ^coding DME may be used for 
somatic or gemdine gene Aerapy. G«[ie tfaerq>y rxiay be perfomied to (i) correct a g^etic deficiency 
(e.g., in the cases of severe combined immunodeficiency (SCID>-X1 disease characteriTsed by X- 
linked inheritance (Cavasana-<!^o, M. et aL ^000) Science 288:669-672), severe combined 

25 immunodeficieucy syndrome associated witib an inherited adenosine deaminase (ADA) deficiency 
CBlaese, RM et al. (1995) Science 270:475-480; Bor^gnoa), C et aL (1995) Sdcaice 270:470475), 
cystic fibrosis ^abner, J. et aL (1993) Cefl 75:207-216; Crystal, R.G. et aL (1995) Horn. Geaie 
Theanapy 6:643^666; Crystal, R.G. et aL (1995) Hura Gene Tbetapy 6:667-703X thalassamias &milial 
hypercholesterolemia, and hemoplulia resulting frcxn Factor Vm or Factor IX defid^cies (Qystal, 

30 R.G. (1995) Science 270:404410; Venna, LM. and N. Sonria (1997) Nature 389:239-242)), (ii) 

express a conditionally lethal gene product (e.g., in the case of cancers which result firom unregulated 
ceU proliferation), or (iii) e?qness a protein which affords protection against mtracellular parasites 
(e.g., against human retroviruses, such as human immunodeficiency virus (HIV) (Baltimore, D. 
(1988) Nature 335:395-396; Poeschla, E. et al. (1996) Proc. Natl. Acad. Sd. USA 93:11395-11399), 

35 hepatitis B or C virus (EIB V, HCV); fungal parasites, such as Candida albicans and Paracoccidioides 
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btasiliensis: and protozoan parasites such as Plasmodium falciparum and Trvpanosama cnm) , lathe 
case where a gonetic deficiency in DME egression or regulation causes disease, the expressim of 
DME from an dpptopiste pqpulatioii of transduced cells may alleviate the clinical manifestations 
caused by the genetic defid^icy. ' 
5 hi a further embodiment of the invention, diseases or disorders caused by deficiencies in 

DME are treated by constructing matnTnallan e^qpzession vectors encoding DME and introducing 
these vectors by mechanical means into DMErdeficient cells. Mechanical transfer technologies for 
use with cells in vivo or ex vitro include (i) direct DNA microinj ection into individual cells, (ii) 
ballistic gold particle delivery, (ui) ^posome-mediated transfection, (iv) receptor-mediated gene 

10 transfer, and (v) the use of DNA transposcms (Morgan, ILAu and W.F. Anderson (1993) Annu. Rev. 
Biochem- 62: 191-217; Ivies, Z. (1997) Cell 91:501-510; Boulay, J-L. and H Rfidpon (1998) Cnrr. 
Opht BiotechnoL 9:445-450). 

ETqxression vectors that may be effective for the expression of DME include, but are not 
limited to, the PC3)NA 3.1, EPITAG, PRCXMV2. PREP, PVAX, PCR2-TOPOTA vectors 

15 (6ivitK)gen, Carlsbad CA), PCMV-SCRIPT, PCMV-TAG, PEGSHWTBRV (Stratagene, La JoUa CA), 
andPTET-OI5?,FrET-ON,PraE2,FIlUa2-LUC,FIX-^ DMEmay 
be expressed using (i) a constitutively active promoter, (e.g., fccm cytomegalovirus (CMV), Rous 
sarcoma vurus (RSV), S V40 virus, tfaynmidme kinase (TK)* or p-actm genes), (ii) an inducible 
promoter (e.g., the tetrai^cline-r^^ulated promoter ((jossen, M. and H. Bujard (1992) Ftoc. Natl. 

20 Acad. Sd. USA 89:5547-5551; Gossen, M. et al. (1995) Science 268:1766-1769; Rossi, F.M.V. and 
ELM. Blau (1998) Qmr. Opin. BiotechnoL 9:451-456), commercially available in the T-REX plasmid 
Cbvitiog^)); the ecdysone-indudble promoter (available in the plasimds WGSXK and PIND; 
hivitrogen); the FK506/rqpaniycm inducible promoter, or the RU486Ai]ifqxristone inducible promoter 
(RossL FJM V. and HM, Blau. supra) ), or (iii) a tissue-specific ptpmotar or the native promotCT of the 

25 endogoious gene encoding DME fiom a normal individual 

Commeicially available liposome transformation kits (e.g., the PERFECT LIPID 
TRANSFBCnON EXT, available from bvitrogen) allow one with ordinary skill in die art to ddiver 
pdynudeotides to target celk in culture aiid require rii^^ 

parameters. In the altemative, tr ansfou n a tion is performed using the calcium phosphate method 
30 (Graham, FX. and AJ. Eb (1973) Virology 52:456-467), or by electroporation (Neumann, E. et al. 

(1982) EMBO J. I 'Ml-MS). The introduction of DNA to primary cells requires modification of 

these standardized mammalian transfecticm protocols. 

In another embodiment of the invention, diseases or disorders caused by genedc defects with 

respect to DME e;q)ression are treated by constructing a retrovirus vector consisdng of (i) the 
35 polynucleotide caicoding DME und^ the control of an indqiendent pronKiter or the retrovirus long 
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tenninal repeat (LTR) promoter, (ii) appropriate RNA packaging signals, and (iii) a Rev-iespoosive 
element (RKE) along witfi additional retrovirus cis-^ctiag RNA sequences and coding seqoraces 
lequired for efficient vector propagation. R^rovirus vectors (e.g., PFB and FFBNEO) are 
conuneccially available (Stratagene) and are based on published data (Riviere, I. et al. (1995) Pcoc. 
5 Nad. Acad. Sci. USA 92:6733-6737), incorporated by referraice herein. The vector is propagated in 
an appropriate vector producing cell line (VPCL) that expresses an mvelope gpne with a tropismfor 
receptors on the target cells or a promiscuous envelope protein such as VSVg (Ainmtano, D. et al. 
(1987) J. ViroL 61:1647-1650; Bender. M.A. et aL (1987) J. Virol. 61:1639-1646; Adam. M.A. and 
AJ). Miller (1988) J. Virol. 62:3802^3806; Dull, T. et al. (1998) J. Virol. 72:8463-8471; Zufferey. R. 

10 et aL (1998) J. ViroL 72:9873-9880). U.S. Patent No. 5,910,434 to Rigg CMefliod for obtaining 
retrovirus packaging cell lines producing higjb transducing efiBciency retroviral supecnatantf *) 
discloses a method for oMaining retrovirus packaging cell lines and is hereby incorporated by 
reference. Propagation of retrovirus vectors, transduction of a population of cells (e.g., CD4^ T- 
cells), and the return of transduced cells to a patient are procedures well known to persons skilled in 

15 the art of gene therapy and have heea well documented (Ranga, U. et al. (1997) J. ViroL 71:7020- 
7029; Bauer, G. et al. (1997) Blood 89:2259-2267; Bonyhadi, Mi. (1997) J. Virol. 71:4707-4716; 
Ranga, U. et aL (1998) Proc. Natl. Acad. Sd. USA 95:1201-1206; Su, L. (1997) Blood 89:2283- 
2290). 

In the altffliative. an adenovurus-based g^e therapy delivery system is used to deliver 

20 polynucleotides encoding DME to cells which have one cmt more g^etic abnormalities with respect to 
' the expression of DMR The coxistniction and packaging of adenovirus-based vectors are well known 
to those with cndinary skill in the art Replication defective adenovirus vectors have proven to be 
versatile for in^Kirting genes ^coding immmK»egulatory proteins into intact islets in the pancreas 
(Csete, M£. et al. (1995) Tran^lantation 27:263-268). Pot^ally useful adraoviral vectors are 

25 described in U.S. Patent No. 5,7(y7,618 to Armentano ("Admovxrus vectors for gene thaapy*'), 
hanebymcoiporated by reference. For adenoviral vectors, see also Antinozzi, PA. et al. (1999) 
Amrn. Rev. Nutr. 19:511-544 and Vcsma. LM. and N. Somia (1997) Nature 18:389:239-242, both 
inccarporated by ref exraice ha:ein. 

In anottier alternative, a herpes-based, goie therapy delivery system is used to deliver 

30 polynucleotides encoding DME to target cells which have one or more genetic abnonnalities with 
respect to the expression of DME. The use of herpes sinqplex virus (HSV>based vectors may be 
especially valuable for introducing DME to cells of the central nervous system, for which HSV has a 
tropism. Theconstnictionandpackagingofherpes4)ased vectors are weUlmown to those with 
ordinary skin in the art A r^UcationH»n5)etem herpes siii5)lex virus (HSV) type 1-based vector has 

35 been used to deliver a reporter gene to the eyes of primates (Liu, X. et al. (1999) Bxp, Bye Res. 
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169:385-395). The oonstnictiQii of a HSV-1 vims vector has also been disclosed m detail in U.S. 
Intent No. 5,804,413 to DeLuca Clfetpes sinqilex virus strains for g^e transfef*)* which is hereby 
inco3fporated by lefeience. U.S. Patrait No. 5,804,413 teaches die use of lecoinbinant HSV d92 which 
consists of a genome containing at least one exc^enoos gene to be tiansf ened to a cell under die 
5 control of the appropriate promoter for purposes including hum^ Also taught by this 

patent are the construction and use of recombinant HSV strains deleted for ICP4, ICP27 and ICP22. 
For HSV vectors, see also Goins, WJ. et al. (1999) J. ViroL 73:519-532 and Xu, R et al. (1994) 
Dev. BioL 163: 152^161, hereby incorporated by reference. The noanipulation of cloned herpesvirus 
sequences, the geneFaticm of recombinant virus f oUowmg the transfecd<m of multiple plasmids 

10 containing different segments of the large hoqpesvirus genomes, the growth bsA propagation of 
herpesvirus, and the infisction of cells with h^pesvirus are techniques well known to those of 
ordinary skill in the art 

In another alternative, an a^havirus (positive, siiigle-stranded RNA virus) vector is used to 
deliver polynucleotides encoding DMB to target cells. The biology of the prototypic alphavirus, 

15 Semliki Forest Virus (SFV), has been studied extensively and gene transfer vectors have been based 
on die SFV gpnome (GarofF, H. and EL-J. Li (1998) Cuir.. Opin. BiotechnoL 9:464469). During 
alphavirus RNA replication, a suVgwomic RNA is generated that normally encodes the viral capsid 
proteins. This subgenonnc RNA rqihcates to higher levels than the fiin length geno^ 
resulting in the overproduction of cscpsid proteins relative to the viral protdbis with enzymatic activity 

20 (e.g., protease and polymerase). Siinilarly,insertmg die coding sequence for DME into the alphavin^ 
gencnne in place of the capsid-coding region results in the production of a large number of DMB- 
coding RNAs and the synthesis of high levek of DME in vectcv transduced cells. While alphavirus 
mf ecticm is typically associated widi cell lysis within a few days, die ability to establish a persistent 
mf ection in hamster nonnal kidney cells CBHK-2 1) with a variant of Sindbis virus (SIN) indicates that 

25 the lytic replication of a^haviruses can be altered to suit the needs of the g^ therapy application 
(Dryga,S^etal. (1997) A^logy 228:74-83). Thewidehostrangeofalphaviruses will allow the 
. introducticm of DME into a variety of cell types. Tbespecifictransductioiiof asubsetof ceOsina 
pqpulaticHi may require the sorting of ceDs jaior to transducticHL The methods of nianq)ulating 
infectious cDNA clones of alphaviruses, performing a^havirus cDNA and RNA transfections, and 

30 perfonning alphavirus infections, are well known to those with ordinary skill in the art 

Oligonucleotides derived from the transcription initiation site, e.g., betwem about positions 
-10 and +10 from the start site, may also be eniployed to inhibit gene e^qnression. Similarly, 
inhibition can be achieved using triple helix base-pairing methodology. Triple helix pairing is useful 
because it causes inhibition of the abilit/ of the double helix to open sufficiendy for the binding of 

35 polymmases, transcription factors, or regulatory molecules. Recent therapeutic advances using 
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triplex ONA have been described in the literature. (See, e.g.. Gee, et aL (1994) in Hnber, 
and B.L Carr, Molecular and Tmim mologic Approaches, Futura Publishing, Mt Kisco NY, pp. 163- 
177.) A complementaiy sequence or antisense molecule may also be designed to block translation of 
mRNA by pieventing the transcript from binding to ribosomes. 

5 Ribozymes, enzymatic RNA molecules, may also be used to catalyze the specific cleava^ of 

KNA. The mechanism of ribo^me action involves sequence-specific hybridization of the libozyme 
molecule to complementaiy targ^ RNA, followed by endonucleolytic cleavage. For example, 
engineered hammerhead motif ribozyme molecules may specifically and efBdently catalyze 
endonucleolytic cleavage of sequences encoding DME. 

10 Specific ribozyme cleavage sites within any potratial RNA target are initially identified by 

scanning tfie target molecule for ribozyme cleavage sites, mcluding the following sequences: GUA, 
GUU, and GUC. Once identified, short RNA sequmces of between IS and 20 ribonucleotides, 
correspcmdmg to the region of the target gene containing the cleavage site, may be evaluated for 
secondary structural features which may lendtf the oligonucleotide inoperable. The suitability of 

15 candidate targets may also be evaluated by testing accessibility to hybridization with complementary 
oligonucleotides using ribonuclease protection assays. 

Complemmtaiy ribonucleic add molecules and ribozymes of the invention may be prepared 
by any inetiiodlmown in the art for the synthesis of nucleic add inolecules. These include techniques 
for diemically synthesizing oHgonucleotides such as solid jrfiase phosfdioramidite chemical synthesis. 

20 Alternatively, RNA molecules may be gloated by in vitro and in vivo transcription of DNA 

sequences encoding DME. Such DNA seq[uences may be incorporated into a wide variety of vectors 
witii suitable RNA polyrnerase promoters such as T7 or SP6. Alternatively, these cDNA constructs 
that synthesize conqpiementary RNA, constitutively or indncibiy , can be introduced into cell lines, 
cells, or tissues. 

25 RNA molecules may be modified to iiicreaseintracennlarstalnUty and 1^ Possible 

modificaticm include, but are not limited to, the addition of flanld^ 

ends of tiie molecule, or the use of phosjj^iofodiioate or 2' Onm^y 1 laflier tiian phosphodiestBrase 
linkages widnn the backbcne of die molecule. This concqit is inheieot in the production of PNAs 
and can be extended in aU of these inolecules by the incfaisioa of nontiaditional b^ 
30 queosine, and wybutosine, as well as acetyl-, methyl-, tino-, and similarly modified forms of adeiune, 
cytidine, guanine, thymine, and uridine which are not as easily recognized by endogenous 
endonucleases. 

An additional embodiment of the invention enconqiasses a method for screening for a 
conqK>und wUch is effictive in alteririgexpressicm of a polynucleotide encod^ Compounds 
35 which may be effective in altering e^tpression of a specific polynucleotide may include, but are not 
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limited to, oligonucleotides, antisense oligonucleotides, ti^>le helix-fanning oligonucleotides, 
tianscnption factors and other polypeptide transcdptional legolatois, and nan-macromolecular 
chemical entities which are cjq>able of interacting with spedfic polynucleotide sequences. Effective 
conqxnmds may alter polynucleotide expiession by acting as either inhibitors or promoters of 

5 polynucleotide egression. Thus, in the treatment of disorders assodated with increased DME 
expression or activity, a compound which specifically inhiUts esqpression of the polynucleotide 
encoding DME may be therapeutically usefid, and m the treatment of disorders associated with 
decreased DME eqiiessicHi or activity, a compound which specifically promotes e}ipression of the 
* polynucleotide encoding DME may be therapeutically useful. 

10 At least one, and up to a plurality, of test compounds may be screened for effectiveness in 

alteiing esqxression of a specific polynucleotide. A test coiiq)ound may be obtained by any method 
commoQly known in the art, including chemical modification of a compound known to be effective in 
altrang polynucleotide e^qnession; selection from an existing, commercially-available or proprietary 
library of naturally-occurring or non-natural chemical compounds; rational design of a confound 

15 based on chemical and/or structural properties of the target polynucleotide; and selection from a 
fibpry of chemical compounds created combinatorially or randm^ A sample comprising a 
polynucleotide encoding DME is exposed to at least one test conq>oundtbm Thesan^le 
may ccsspise, for example, an intact or permeafailized cell, or an in vitro cell-fcee or reconstituted 
Uiochemical Systran. Alterations in the esqpresaon of a polynucleotide encodii^ 

20 any method commonly known in the art Typically, the expressicm of a specific nucleotide is 

detected by hybridization with a probe having a nucleotide sequmce complementary to the sequence 
ofthe polynucleotide encoding DME. The amount of hybddizaticm may be quantified, tlnisfiramaing 
die basis for a coiiq[>arison of tfie expiression of the polynucl^^ 

one or more test conqpounds. Delectionof achaiigeinthe^qxressionof apofynucleotideexposedto 
25 a test ccmqxiund indicates ttiat the test ccxx^oond is eflfecti^^ 

po^ucleodde. A screra for a compound elective in altering expression of a qied 

ran K> rarnP^ ftiit, fnr ATampI^, iigmyr a ^Irlii'MVMrrhi mmiVCeS tXHnbe gePC CXprCSSion SVStcm 

(Atidns, D. et aL (1999) U.S. Patent No. 5,932,435; Amdt, GM. et aL (2000) Nucleic Adds Res. 
28£15) or a human cell line such as HeLa cell (CSaike, MX. et al. (2000) Biochrat Biophys. Res. 
30 Commun. 268:8-13). A particular embodunent of die preset inv^on involves soeoun^ 

combinatorial library of oligonucleotides (such as deoxyribonucleotides, ribonucleotides, peptide 
nucleic acids, and modified oligcmucleotides) for antisense activity against a spedfic polynudeotide 
sequence (Bruice, T.W. et al. (1997) U.S. Patent No. 5,686,242; Bruice, T.W. et aL (2000) U.S. 
Patent No. 6,022,691). 

35 Many methods for introducing vectors into cells or tissues are available and equally suitable 
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for use in viva in vitro, and ex viva For ex vivo therapy, vectors mav be introduced into stem cells 
taloen bam the patient and clonally propagated for autologous transplant back into that same patient 
Delivery by tiansfection, by liposome injections, or by polycationic amino polymers may be adiieved 
using methods whidi are well known in the ait (See, e.g., Goldman, C.K. et al. (1997) Nat 
5 Biotechnol. 15:462-466.) 

Any of the therapeutic methods described above may be 2q>plied to any subject in need of 
such therapy, including, for example, mammals such as humans, dogs, cats, cows, horses, rabbits, and 
mookeys. 

An additional embodiment of the invrnticm relates to the administration of a conqx)sitiQn 

10 which gw^ally comprises an active ingredient formulated widi a phannaceutically acceptable 
exdpient Excipients may include, for exainple, sugars, starches, celluloses, gums, and proteins. 
Various formulations are commonly known and are thoroughly discussed in the latest edition of 
Remington's Pharmflrp jitical Sciences (Maack Publishing, Baston PA). Such compositions may 
consist of DME, antibodies to DME, and mimetics, agooists, antagonists, or inhibitors of DME. 

15 The conqx>sitions utilized in this invention may be adnunistered by any number of routes 

including, but not limited to, oral, intravenous, iotramuscular, intxa-arterial, intramedullary, 
intrathecal, intraventricular, pulmonary, transdermal, subcutaneous, intraperitcmeal, mtranasal, 
literal, tc^ical, sublingual, or rectal means. 

Ccmipodtions for pulmonary adnunistration may be prepared in liquid oar dry powder form. 

20 These CQnqx>sitions are generally aerosolized immediately pdor to Inthe 
case of small molecules (e.g. traditional low molecular wei^t organic drugs), aerosol delivery of 
&st-actingfoEmulatioDS is well-known in the art In the case of macromolecules(e.g. larger peptides 
and proteins), recCTt developments in the field of puhnonary delivery via tiie alveolar region of the 
hsog have enabled the practical deliveiy of drugs sudi as insulin to blood circulation (see, e.g. , Patton, 

25 J.S.etaI.,U.S.P&t^ No. 5,997,848). Puhnonary delivery has the advantage of adinmistration 
without needle injection, and obviates die need for potentially toxic penetration enhancers. 

CcxDpositicflds suitable for use in the invention include co!nqx)sitions i^ierein die active 
ingrediCTts are contained in an efifective amount to achieve the intended purpose. The detenmnatioQ 
of an effective dose is well within ttie capability of those skilled in the art 

30 Specialized forms of compositions may be prepared for direct intracellular delivery of 

macromolecules conqnising DME or firagments th^eof . For exaiiq>le, liposome preparations 
containing a cell-inq>ermeable macromolecule may prraiote cell fusion and intracellular delivery of 
the macromolecule. Alternatively, DME or a fragment th^eof ruay be joined to a short cationic N- 
' terminal portion from the HIV Tat-lproteiiL Fusion proteins thus genmted have been found to 

35 transduce into die cells of all tissues, including the brain, in a mouse unodel system (Schwarze, S JL et 
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al. (1999) Science 285:1569-1572). 

For any con^und, die thenq[>eutically effective dose can be estimated initially either in cell 
culture assays, e.g., of neqplastic ceUs, or in animal models such as mice, rats, rabbits, dogs, 
monkeys, or pigs. An animal model may also be used to detennnietlieqypropriat^ 

5 range and route of administration. Such information can then be used to detennine useful doses and 
routes for administratioD m hmnans. 

A tfaerapeutically effective dose refers to that amount of active ingredient, for exanq)le DMB 
or firagments thereof, antibodies of DMB, and agcmists, antagonists or mhibitcnrs of DMB, which 
ameliorates tiie syiiq>toms or condition. Thenq>euticefQcacy and toxicity rnay be determined by 

10 standard pharmaceutical procedures in cell cultures or with experimental animals, such as by 

calculating the ED50 (the dose fhei^utically effective in 50% of the population) or LD5D (the dose 
lethal to 50% of the population) statistics. The dose ratio of toxic to ther^utic effects is the 
therapeutic index, which can be expressed as the UDs/EDsa ratio. Cocopositions which exhibit large 
therapeutic indices are preferred. The data obtained from cell culture assays and animal studies are 

15 used to formulate a range of dosage for human use. The dosage contained in such con^ositions is 
preferably witiun a range of circulating concratrations that includes the ED5D with littie or no toxicity. 
The dosage varies within tiiis range depending upcm the dosage form enq;>loyed, the sensitivity of the 
patient, and the route of administration. 

The exact dosage will be detenmned by tiie practiti(»Qer, in light of factors related to the 

20 subject requiring treatment Dosage and admimstration are adjusted to provide sufficient levels of the 
active moiety or to maintain the deshed effect Factors which may be taken into account include the 
severity of the disease state, the general health of the subject the a^ weight, and gender of the 
sulgect, time and frequency of administration, drug cQml3inatio8i(s), reacticm sensitivities, and 
response to thei^yy. Lcmg-actipg cxmipositioiis may be adnnmstcared every 3 to 4 dajfs, every wedc, 

25 or biweddy dqiending on the half-fife and clearance rate of tiie particular formulation. 

Normal dosage ^mmmts may vary from about 0.1 A<g to 100,000 A<g> ^ to a total dose of 
sd)ont 1 grain, dq[)endiDgupcmtiie route of adnnnislc^ Guidance as to particular dosages and 
Tnfttho^ijy of ddiveiy is provided in the fitexatme and generally available to practitionera in die art 
Those skilled in die art will employ differait formulations for nucleotides than for protdns or thdr 

30 inhibitors. S inwlar lyj HftHvery of polynucleotides or polyp^tides will be specijBc to particular cells, 
conditions, locations, etc. 
DIAGNOSTICS 

hi ancHher »nbodimc»it, antibodies y^Mch specifically bind DME may be used for die 
Hiagno<i?iig of disocd^ characterized by e?qn:ession of DME, or in assays to monitor patients being 
35 treated with DMB or agonists, antagonists, or inhibitors of DME. Antibodies useful for diagnostic 
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purposes nmy be prepared in the same maimer as descri^ Diagnostic assays 

for DME include mediods which utilize the antibody and a label to detect DME in human body fluids 
or in extracts of cells or tissues. Tlieantibodiesmay be used with or without inodification, and may 
be labeled by covalent or non-covalent attachment of a reports molecule. A wide variety of reporter 
S molecules, several of which are described above» are known in the art and may be used. 

A variety of protocols fc»: measuring DME, mcluding EUSAs, SIAs, and FACS, are known 
in die art and provide a basis for diagnosing altered or abnormal levels of DME e^qsression. Normal 
or standard values for DME expres sion are established by condnning body fluids cac cell extracts 
taken fiom normal roammalian subjects, for exanqple, human subjects, with antibodies to DME und^ 

10 conditions suitable for conq>lex formation. The amount of standard complex formation may be 

quantitated by various methods, such as {diotometric means. Quantities of DME e^nessed in subject, 
controU and disease samples fcom biopsied tissues are compared with Deviation 
betwe^ standard and subject values establishes the parameters for diagnosing disease. 

In another embodiment of the invention, the polynucleotides encoding DME may be used for 

15 diagnostic purposes. The polynucleotides which may be used include oligonucleotide sequ^ces, 
complemmtary RNA and DNA molecules, and FHAs. The polynucleotides may be used to detect 
and quantify gene expression in biq[>sied tissues in which expression of DME may be correlated with 
disease. Thediagnosticassay may be used to deternime absence, presence, and excess egression of 
DME, and to mrautor regulation of DME levels during therapeutic intervmtion. 

20 hi one aspect, hybridization with PGR probes which are C2q>able of detecting polynucleotide 

sequences, including genomic sequences, ^coding DME or closely related molecules may be used to 
identify nucleic acid sequences which encode DME. Thespecifidtyof the probe, whether it is made 
from a highly qpedficr^on, e.g., the S'n^^ulatoiy r^cn, or from a less qpedficn^ion, e.g., a 
conserved rnc^ and the string^y of the hybndization or anq>Mcati 

25 probe idrntifies only naturally occurring sequences encoding DME, aUelic variants, or related 
sequences. 

Probes may also be used for tibe detection of related sequences, and may have at least 50% 
sequence identify to any of the DME encoding sequences. The hybridization probes of the subject 
invention inay be DNA or RNA and may be derived from the sequence of 
30 genoimc sequences including promoters, enhancers, and introns of the DME gene. 

Means for producing specific hy taidization probes for DNAs encoding DME include the 
cloning of polynucleotide sequences encoding DME or DME derivatives into vectors for the 
producdon of mRNA probes. Such vectors are known in the art, are commercially available, and may 
be used to synthesize RNA probes in vitro by means of the addition of the appr(^niate RNA 
35 polymerases and the appr o pr iate labeled nucleotides. Hyhridizationprobesmay belabeledby a 
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variety of rqxnter groiqps, for exan^le^ by radionuclides such as ^ or ^S, or by enzymatic labels, 
such as alkaline phosphatase coupled to the probe via avidin/tnotin coupling systems, and the like. 

Polynucleotide sequences encoding DME may be used for the diagnosis of disorders 
associated with e^qpression of DME. ExanQ>le5 of such disorders include, but are not limited to, an 
5 autoimnume/inflammatory disorder, such as acquired inmmnodefidency syndrome (AIDS), 

Addison's disease, adult respiratory distress syndrome, allergies, ankylosing spondylitis, anQrloidosis, 
anemia, asthma, atherosclerosis, autoimmune hemolytic anemia, autoimmune thyroiditis, autoimmune 
polyradocrinopathy-candldiasis-^sctodCTiial dystrophy (AFECED), bronchitis, cholecystitis, contact 
dermatitis, Crohn's disease, atopic dermatitis, dermatomyositis, diabetes mellitus, emphysema, 

10 episodic lyiophopenia with lynqphocytotoxins, erythroblastosis fetalis, erythema nodosum, atrophic 
gastritis, glomerulonephritis, Goodpasture's syndrome, gout. Graves* disease, Hashimoto's 
thyroiditis, hypereosincplulia, irritable bowel syndrome, multq^le sclerosis, myasthenia gravis, 
myocardial or pericardial inftammaticm, osteoarthritis, osteoporosis, pancreatitis, polymyositis, 
psoriasis, Rdter's syndrome, riieumatoid arthritis, scleroderma, Sjdgren*s syndrome, systemic 

IS anapfaylaris, systemic hxpus erythematosus, systemic sclerosis, thrornbocytopenic purpura, ulcerative 
colitis, uveitis, Werner syndrome, complications of cancer, hemodialysis, and extracorporeal 
circulation, viral, bacterial, fin^, parasitic, protozoal, and helminthic infections, and trauma; a cell 
prolif»ative disorder, sudi as actinic keratosis, arteriosclerosis, atherosclerosis, bursitis, cinfaosis, 
hqyatitis, nmxed connective tissue disease (MCTD), myelofil»osis, paroxysmal nocturnal 

20 hemoglobinuria, polycythemia vera, psoriasis, primary thromboc^themia, and cancers including 
adenocarcinoma, leukemia, lymphoma, melanoma, myeloma, sarcoma, teratocardnoma, and, in 
particular, cancers of the adrenal gland, bladder, bone, bcxie marrow, brain, breast, cervix, gidl 
bladder, gai^glia, gastrointestinal tract, heart, kidney, liver, hn^ muscle, ovary, pancreas, paraAyroid, 
penis, prostate, salivary glands, skin, spleen, testis, thymus, tfiyroid, and uterus; a neurological . 

25 disorder such as qpilepsy , ischemic cexebstovascular disease, stroke, cerebral neoplasms, Alzheimer's 
disease. Pick's disease, Ebmtinglon's disease, dementia, Piukinsoi&'s disease and oAer extrapyramidal 
disGiders, amyotrppfaic lataal sdearods and other rnotorneoron disoidears, progressive naual 
imiscnlar atrofAy , rhinitis {ugmeotosa, beareditary ataxias, mnltj^le sderosis and dfaer dranyelinating 
diseases, bacterial and viral meningitis, brain abscess, subdural empyema, epidural abscess, 

30 siqq[)urative intracranial thrombophlebitis, myelitis and radiculitis, viral central nmous system 
disease, prion diseases including kuru, Greutzfeldt-Jakob disease, and Gerstmann- 
Straussler-Scheinker syndrome, fatal fiRr"ilial insomnia, nutritional and metabolic diseases of the 
nervous system, neurofibromatosis, tuberous sclerosis, cerebelloretinal hemangioblastomatosis, 
encephalotrigeminal syndrome, mental retardation and other developmoatal disorders of the central 

35 norvous system including Down syndrome, cerebral palsy, neuroskeletal disorders, autonomic 
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n^ous system disoideis, cranial nerve disorders, spinal cord diseases, muscular dystrophy and other 
neuromuscular disorders, p^ph^al nervous ^stem disorders, dennatomyositis and polymyositis, 
mherited, metabolic, endocrine, and toxic myopathies, myasdnraiia gravis, p^odic paralysis, mental 
disoQcd^ including mood, anxiety, and schizophrenic disordm, seasonal affective disorder (SAD), 

S akathesia, amnesia, catatonia, diabetic neuropathy, tardive dyskmesia, dystonias, paranoid psychoses, 
postherpetic neuralgia, Tourette's disorder, pt)gressive supranuclear palsy, corticobasal 
degeneration, and familial firontotempoial dementia; a developmental disonler, such as renal tubular 
acidosis, anemia. Gushing' s syndrome, achondrqplastic dwarfism, Duchenne and Becker muscular 
dystrophy, epilepsy, gonadal dysgenesis, WAGR syndrome CWilms* tumor, aniridia, g^tourinary 

10 abnormalities, and mental retardation), Smitti-Magenis syndrome, myelodysplastic syndrome, 
hereditary nmcoepithelial dysplasia, hereditary keratodermas, hereditary neuropathies such as 
Qsarcot-Marie-Tooth disease and neurofibromatosis, hypothyroidism, hydrocephalus, seizure 
disorders such as Syndenham's chorea and cerebral palsy, spina bifida, anencephaly, 
craniorachischisis, congenital glaucoma, cataract, and sensorineural hearing loss; an endocrine 

15 disorder, such as disorders of the hypothalamus and pituitary resulting fiom lesions such as primary 
brain tumors, adenomas, infarction associated with pregnancy, hypqphysectoiny, aneurysms, vascular 
malformations, thronibosis, infections, inmnmological disorders, and complications due to head 
trauma; disorders associated with hypopituitarism including hypogonadism, Shedian syndrome, 
diabetes in^pidus, Kallman's disease, Band-SchuUer-airistian disease, Letterer-^iwe disease, 

20 sarcoidosis, empty sella syndrome, and dwarfism; disorders associated witfi hyperpituitarism 
including acromegaly, gia^iti^, and syndrome of inappropriate antidiuretic hormone (ADH) 
secretion (SIADH) often caused by benign adraoma; disorders associated with hypothyrmdism 
iricfaid&Dig gqiter,rnyxederna, acute thyroiditis associated with bactm . 
associated with viral infection, autoimmune thyroiditis (Hashimoto^s disease), and oietinism; 

25 disorders associated with hyperthyroidism including thyrotoxicosis and its various forms. Grave's 
disease, pretibial n^xedema, toxic multinodular goiter, thyroid cardncxna, and Rummer's disease; 
disQidm associated with hyperparatiiyroidism including Conn disease (dironic hypercalemia); 
pancreatic disorders such as Type I or Type n diab^es inellitDS and assoda 
disorders associated with the adrenals such as hyperplasia, carcinoma, or adenoma of the adrraial 

30 cortex, hypertension associated with alkalosis, amyloidosis, hypokalemia, Cushing's disease, liddle's 
syndrome, and Amold-Healy-Gordon syndrome, pheochromocytoma tumors, and Addison's disease; 
disorders associated witii gonadal steroid hormcmes such as: in women, abnormal prolactin 
production, infertility, endometriosis, perturbations of the menstrual cycle, polycystic ovarian disease, 
hyperprolactinemia, isolated gonadotropin deficiency, amenorrhea, galactorriiea, hermaphroditism, 

35 hirsutism and virilization, breast cancer, and, inpost-menc^ausal wom^ osteoporosis; and, in men. 
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Leydig cell de&dency, male climacteric phase, and getminal cell aplasia, hypeigoiiadal disorders 
associated with Leydig cell tumors, androgen resistance associated with absence of androgm 
receptors, syndrome of S Orreductase, and gynecomastia; an eye disorder, such as conjunctivitis, 
teratoconjunctivitis sicca, keratitis, qnsclmtis, iritis, posterior uveitis, glaucoma, amaurosis fugax, 

5 ischemic optic neuropathy, optic neuritis, Leber's hereditary optic neuropathy, toxic optic 

neuropathy, vitreous detachment, retinal detachment, cataract, macular d^eneraticm, crastral serous 
chorioietinqpathy, retinitis pigmentosa, melanoma of the choroid, retrobulbar tumor, and chiasmal 
tumor; a metabolic disorder, such as Addison's disease, c^ebrotendinous xanthomatosis, congenital 
adrenal hyperplasia, coumarin resistance, cystic fibrosis, diabetes, fatty hepatocinbosis, 

10 fiiictose-l,6-diphosphatase defidency, galactosemia, goiter, glucagonoma, glycogen storage diseases, 
hereditary finictose intolerance, hyp^adrenalism, hypoadrenalism, hyperparathyroidism, 
hypoparathyroidism, hypercholesterolemia, hyperthyroidism, hypoglycemia, hypothyroidism, 
hyperlipidemia, hyperlipemia, lipid inyopathies, lipodystrophies, lysosomal storage diseases, Menkes 
syndrome, occipital horn syndrome, mannosidosis, neuraminidase deficiracy, obesity, pentosuria 

15 phenylketonuria, pseudovitamin I><]eficiency rickets; hypocalcemia, hypophosphatemia, 

postpubescenit cerebellar ataxia, and tyrosinemia, and a gastrointestinal disorder, such as dysphagia, 
peptic esoi^iagitis, esophageal spasm, esophageal stricture, esophageal carcinoma, dyspepsia, 
iudigestion, gastritis, gastric carcinoma, anorexia, nausea, emesis, gastroparesis, antral cac pyloric 
edema, abdominal angina, pyrosis, gastroenteritis, intestinal obstruction, infections of the intestinal 

20 tract, peptic ulcer, cholelithiasis, cholecystitis, cholestasis, pancreatitis, pancreatic carcinoma, biliary 
tract disease, hepatitis, hyperbilirubinemia, hereditary hyperbUirubinemia, cirrhosis, passive 
congestion of the liver, hepatoma, infectious colitis, ulcerative colitis, ulcerative proctitis, Crohn's 
disease, Whqiple's disease, Mallosy-Weiss syndrome, colonic carcinoma, colonic obstruction, 
irritable bowel syndrome, short bowel syndrome, dianfaea, constqpation, gastrointestnial h^nonhage, 

25 acquired immunodeficiency syndrome (AIDS) enteropathy, jaundice, hepatic encephalqpathy, 
hqpatOTBnal syndrome, hepatic steatosis, hemodiroinatosis, Wilson's disease, a]|^ha,-antitiypfim 
deficiency, Reye's syndrome, primary scleiomig cholangitis, liver infiarction, portal vein obstruction 
and ihroinbosis, cenlrilobular necrosis, peliosis hepatis, hepatic vein thrombosis, veno-occlusive 
disease, preeclanopsia, eclanipsia, acute fatty liver of pregnancy, intrahepatic cholestasis of 

30 pregnancy, and hq>atic tumors including nodular hyperplasias, adenoxnas, arid carc The 
polynucleotide sequences encoding DME may be used in Southern or northern analysis, dot blot, or 
other membrane4)ased technologies; in PGR technologies; in dipstick, pin, and multiformat EUSA- . 
like assays; and in microarrays iitiKTing fluids or tissues firom patients to detect altered DME 
expression. Such qualitative or quantitative methods are well known in the art 

35 In a particular aspect, the nucleotide sequences encoding DME may be useM. in assays that 
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detect the presence of associated disQrd^,pa^ The nucleotide 

sequCTces encoding DME niay be labeled by standaid methods and added to a fluid or tissue sanqile 
from a patient und^ conditioEns suitable for tfie fonnation of hybridization cogaq^Iexes. After a 
suitable incubation period, the sample is washed and the signal is quantified and conqpaced with a 

5 standaid value. Iftbeamountof signal in the patimtsanq)le is significantly altered 

a control sample then the presence of altered levels of nucleotide sequmces ^coding DME in the 
saxxqple indicates the presence of the associated disoider. Such assays may also be used to evaluate 
the efiScacy of a particular therq)euttc treatment regimm in animal studies, in clinical trials, or to 
monitor the treatment of an individual patient 

10 In otdsx to provide a basis for the diagnosis of a disorder associated with expression of DME, 

a normal or standard profile for expression is established. Thisrnaybeaccoiiq>Ushedbycoinbining 
body fluids or ceO extracts taken from normal subjects, either animal or human, with a sequence, or a 
fragment tihereof , encoding DME, under conditicxis suitable for hybridization car aiiq>lificati(xi. 
Standard hybridizati(»i may be quantified by comparing the values obtained bom normal subjects 

15 with values from an experiment in which a known amount of a substantially purified polynucleotide 
is used. Standard values obtained in this manuCT may be compared with vduesobtaind 
saixq>lesfixHnpatients who are symptCHnatic for a disorder. Deviation ftom standard values is used to 
establish the presence of a disord^. 

Once the presence of a disorder is established and a treatment protocol is initiated, 

20 hybridization assays may be repeated on a regular basis to determine if the level of expression in the 
patient begins to approximate that wUch is observed in the nonnal subject The results obtained from 
successive assays may be used to show the efficacy of treatm^t over a period ranging fiom several 
days to months. 

With respect to cancer, the jfteseoce of an abnormal amount of transcrqyt (eitber under- or 
25 overexpressed) in biopsied tissue ficxn an individual may indicate A predisposition for the 
devel(q>nfint of the disease, or may provide a rneaiis for detectiri^ die 

of actual clinical syn^toms. A moie definitive diagnosis of this type rnayaUow health pcofessioi^ 
to enyloy preventative measures or aggressive treatment earliear d>errf>y prevoiting tiie develqpment 
or fiirtfa^ progression of the cancer. 

30 Additional diagnostic uses for oligonucleotides designed from die sequences encoding DME 

may involve the use of PGR. These oligomers may be chemically synthesized, genmted 
enzymatically, or produced in vitro . Oli^mers wiU prefierably contain a fragment of a polynucleotide 
encoding DME, or a fragment of a polynucleotide complementary to the polynucleotide encodiog 
DME, and will be employed under optinnzed conditions for identification of a specific gene or 

35 condition. OHgomers nmy also be employed under less stringent conditions for detection or 
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quantification of closely related DNA or RNA sequences. 

hk a particular aspect, oligonucleotide primers derived firom the polynucleotide sequences 
encoding Dl^iiiay be used to detect single nucleotide polymori^^ SNPsare 
substitutions, insertions and deletions that are a frequent cause of inherited or acquired genetic 
5 disease in humans. Methods of SNP detection include, but are not limited to, single^tranded 
conformation polymorphism (SSCP) and fluorescent SSCP (fSSCP) metlKxls. In SSCP, 
oligonucleotide primers derived from the polynucleotide sequraces encoding DMB are used to 
anqdify DNA using the polymerase chain reaction Q'CR TheDNAmay bed^vedyfarexan^le, 
fromdiseasedornormaltissue,biopsysanq>le5, bodily fluids, and the like. SNPs in the DNA cause 

10 differences in the secoaadary and tertiary structures of PGR products in single-stranded fc»m, and 
these differences are detectable using ^1 electrophoresis in non-denaturing g^k^ ]hfiSCCP,the 
oligonucleotide primers are fluorescentiy labeled, in^ch allows detection of the aII^)limers in high- 
throughput equipment such as DNA sequencing machines. Additionally, sequrace database analysis 
methods, termed in silico SNP (isSNP), are capable of identifying polymoiphisms by comparing the 

IS sequence of individual overlapping DNA fragments which assemble into a common consrasus 
sequence. These CQnqputer4)ased methods filt^ out sequence variations dne to lab^ 
of DNA and sequencing errors using statistical models and automated analyses of DNA sequence 
chromatc^grams. In the dteniative, SNPs inay be detected and characterized by inassspectron^ 
using, for example, the hi^ througfq^ut MASSARRAY.system (Sequenom, inc., San Diego CA). 

20 Methods which may also be used to quantify the expression of DME include radiolabeling or 

biotinylating nucleotides, coanoqplification of a control nucleic acid, and interpolating results from 
standard curves. (See, e.g., Melby, P.C et aL (1993) J. Immunol. Mediods 159:235-244; Duplaa, C 
^ aL (1993) Anal. Biochem. -212:229-236.) Hie speed of quantitation of mnltiple $an^>Ies may be 
accelerated by running the assay in a highHhrougjbput format where the oligomer or polynucleotide of 

25 interest is presented in various dilutions and a spectrophotom^c or colorimetric response gives 
rapid quantitatiosL 

hi further embodiments, oligonucleotides or longer fragments derived from any of die 
pdymicleotide sequences described heidn may be used as demer^ Themicroarray 
can be used in transcrqyt mapn^ techniques which mcmitor the relative esqpression levels of large 

30 numbers of genes simultaneously as described below. The ndcroarray may also be used to identify 
genetic variants, imitations, and polymorphisms. This information may be used to determine gene 
fimction, to understand the genetic basis of a disorder, to diagnose a disorder, to monitor 
progression/regression of disease as afimction of gene expression, and to develop and monitor the 
activitiesof therapeutic ag^ts in the treatment of disease. In particular, this information may be used 

35 to develop a pharmacogoiomic profile of a patioat in ord^ to select the most apptopdaSB and 
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effective treatment tegimea for that patient For exanq>le, therapeutic agents which ate hig^ily 
effective and display the fewest side effects may be selected for a patient based on his/her 
pharmacogenomic profile, 

hi another embodimmt, DMB, fragmmts of DME^ or antibodies specific for DMB may be 
S used as elements on a microarray. The microarray may be used to momtor or measure protein-protein 
interactions, drug-taiget interactions, and gene ejqxcession profiles, as described above. 

A particular embodiment relates to the use of the polynucleotides of the present invention to 
gwerate a transcript imag^ of a tissue or cell type. A tcaiiscriptiinage represents the global pattern of 
g^ieexpressioii by a particular tissue or cell type. Global g^ expression patterns are analyzed by 

10 quantii^irig the number of ex|H?essed genes and their rel^ . 
a ^ven time. (See Seilhamer et al., ''Comparative Gme Transcript Analysis/* U.S. Patent No. 
5,840,484, expressly incorporated by reference herein.) Thus a transcript image may be generated by 
hybridizing the polynucleotides of the preset invention or their conq>lements to the totality of 
traiiscripts or reverse tnmscripts of a particular tissue or cell type, hi one embodiment, the 

15 hybridization takes place in higtHtiux>ughput format, wherein the polynucleotides of the present 
invration or then: conq>lements comprise a subset of a plurality of elemmts on a microanay. The 
resultant transcript image would provide a profile of gene activity. 

Transcript images may be generated using transcripts isolated fix)m tissues, cell lines, 
biopsies, or other biological samples. The transcript image may thus reflect gene expression in vivo. 

20 as in the case of a tissue or biopsy sample, or in vitro, as in the case of a cell line. 

Transcript images which profile the expression of the polynucleotides of the present 
invention may also be used in conjunction with in vitro model systems and preclinical evaluation of 
pharmaceuticals, as well as toxicological testing of industrial and naturally-occuning enviromnraital 
con^unds. AUcon^unds induce charactimsticg^expressicm patterns, frequently 

25 molecular fingert«ints or toxicant signatures, which are indicative of mechanisms of action and 
toxicity (Nuwaysir, RF. et aL (1999) MoL Cardnog. 24:153-159; Steincr, S. and Ni. Anderson 
(2000) ToxicoL Lett 1 12-1 13:467-471, ejqxressly mcoqxirated by referaice hcaiem). If a test 
conqx)und has a signature similar to tiiat of a con^und with known toxicity, it is hkely to share 
those toxic properties. These fin^prints or signatures are most useful and refined when they contam 

30 expression information from a large number of genes and gene families. Ideally, a genome-wide 
measurement of expression provides the highest quality signature. Even genes whose expression is 
not altered by any tested compounds are inq)ortant as well, as tiie levels of expression of these genes 
are used to normalize the rest of the e^qnession data. The normalization procedure is useful for 
coiiQ)arison of expression data after treatment with different coii9X)unds. While the assignment of 

35 gene function to elements of a toxicant signature aids in interpretation of toxicity mechanisnos. 
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knowledge of gene function is not necessary for the statistical matching of signatures which leads to 
prediction of toxicity. (See, for example. Press Release 00-02 from the National Institute of 
Environmental Health Sciences, released Fefaruaiy 29, 2000, available at 
http://www.mehs.iiih.gov/oc/news/toxchip.htm.) Therefore, it is important and desirable in 

5 toxicolo^cal screening using toxicant signatures to include all expressed gene sequences. 

la one embodiment, the toxicity of a test conqpound is assessed by treating a biological 
sanq>le containing nucleic acids with the test compound. Nucleic adds that are expressed in the 
treated biological san^le are hybridized with one or more probes specific to the polynucleotides of 
the present invention, so that transcript levels corresponding to tiie polynucleotides of the present 

10 invention may be quantified. The traiiscri^t levels in the treated biolc^cal sample are conq)ared wit^ 
levels in an untreated biological sanq)Ie. Differences in the transcript levels between the two samples 
are indicative of a toxic re^onse caused by the test conqpound in the treated san^le. 

Another particular embodiment relates to the use of tiie polypeptide sequences of the present 
invention to analyze tiiepcoteome of a tissue or cell type. The term proteome refers to the global 

15 patt^ of protein e^qpression in a particular tissue or cell type. Each protein component of a 

proteome can be subjected individually to further analysis. Proteome expression pattmis, or profiles, 
are analyzed by quantifying the nuniber of expressed proteins and their relative abundance under 
given conditions and at a given time. A profile of a cell's proteome may thus be genmted by 
separatiiig and analyzmg the polypq>tides of a particular tissue or ceUty^ In cme embodimmt, the 

20 sq>aration is achieved using two^imraisional gel electrophoresis, in wULcb proteins from a sample are 
separated by isoelectric focusing in the first dimraision, and then according to molecular weight by 
sodium dodecyl sulfate slab g^l electn^hoiesis in the second dimension (Steiner and Anderson, 
supra> . The proteins are visualized in tile gel as discrete and uniquely position 
staining the gel with an agent such as Coomassie Blue or silver or fluorescCTt stains. Hie optical 

25 density of each protein spot is gCTsrally proportional to the level of the protem in the sample. The 
apdcsd densities of equivalentiy positioned protein spots fiomdifierent samples^ for exanqple, from 
biological samples either treated or untreated with a test compcHmd or titer^peutic agent, are 
conipared to identify any changes in protein 5p(^ density related to tte Tbe proteins in the 

spots are partially sequenced using, for example, standard methods enqiloying chemical or enzymatic 

30 cleavage followed by noass spectrometry. The identity of the protein in a spot may be determined by 
conqiaring its partial seqnmce, preferably of at least 5 contiguous amino acid residues, to the 
polypeptide sequences of the present invention, la some cases, further sequence data may be 
obtained for definitive protein identification. 

A proteondc profile may also be generated using antibodies specific for DMB to quantify the 

35 levels of DMB ejqxression. Ihoneembodiiiient, theantit)odiesareusedasekmentsonamicroaiiay. 
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and protein expression levels aie quantified by e:q)osing the microarray to tbe sanq>le and detecting 
the levels of protein bound to each airay element (Lueking, A. et al. (1999) Anal. Biochem. 270:103- 
111; Mendoze, L.G. et aL (1999) Biotechniques 27:77S-788). Detection may be pecfamoed by a 
variety of methods known in the ait, for exanqple, by reacting the proteins in the sanqple with a flnol- 
5 or amino^ieactive fluorescent conq)ound and detecting the amount of fluorescence bound at each 
array element 

Toxicant signatures at ttie proteonoe level are also useful for toxicological screening, and 
should be analyzed in paraUel with toxicant signatures at the transcript level. Tbereisapoor 
conelation between transcript and protein abundances for some protdns in some tissues (Andmon, 
10 NI^ and J. Seilhamer (1997) Electrophoiesis 18:533-537), so proteome toxicant signatures may be 
usefid in the analysis of compounds which do not signiflcantly affect the transcrqpt image, but which 
alter the proteomic profile. Id addition, the analysis of transcripts in body fluids is difGcult, due to 
Tsxpid degradation of niRNA, so proteomic fffofiling may be more reliable and informative in such 
cases. 

15 hi anoth^ embodiment, the toxicity of a test conq»ound is assessed by treating a biological 

sanople containing protems with the test conq>ound. Ptoteins that are esqnressed in the treated 
biological sainple are separated so that the amount of each protein can be quantified Theamountof 
each protein is cQ^^>ared to die amount of the corresponding protein in an untreated biological 
sample. Adifferenceintfaeamountof protein between the two saiiq>Ies is indicative of a 

20 response to the test coiiq)ound in the treated sample, bidividual proteins are identified by sequencing 
the amino acid residues of the individual proteins and comparing these partial sequences to the 
polypeptides of the present invention. 

fid another embodiment, the toxidty of a test compound is assessed by treating a biological 
saiiq)le containing pn)teins with the test compound. Proteins fiom the biological san^le are 

25 incubated with antibodies specific to the polypeptides of the preseiit invention Theamountof 
protein recognized by the antibodies is quantified. Hie ainount of piotdn in tbe treated biological 
sainple is conqwred with the amount in an nn trea fed biological sanyle. AdifferCTceintfaeamountof 
protein between the two san^les is indicative of a toxic response to the test ccmqKHind in die treated 
sample. 

30 Mcroarrays may be prepared, used, and analyzed using inethods known in the art. (See, e.g., 

Brennan, T.M. et al. (1995) U.S. Patent No. 5,474,796; Schena, M. et al. (1996) Proc. Natl. Acad. Sci. 
USA 93:10614-10619; Baldeschweiler et al. (1995) PCT appUcation W095/2511 16; Shalon, D. et aL 
(1995) per appUcation WO95/35505; HeUer, RA. et al. (1997) Proc. NaU. Acad. Sci. USA 94:2150- 
2155; and Heller, MJ. et al. (1997) U.S. Patent No. 5,605,662.) Various types of microarrays are 

35 well known and thoroughlv described in DNA Microan-avs: A Practical Approach, M. Schena, ed. 
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(1999) Oxford University Press, London, hereby esqoessly incorporated by reference. 

Jn anotfaer embodimmt of the invention, nucleic add sequences encoding DME may be used 

to g^erate hybridization probes useful in m^nn^ Hith^ 
coding or noncodiDg sequences may be used, and in some mstances, noncoding sequences may be 

5 preferable over coding sequences. For exanq>le, conservaticHi of a coding sequence among membm 
of a multi-gene funily may potaitia% cause undesired cross l^facidization during chromosomal 
mappmg. The sequences may be inapped to a particular chromosonie, to a specific 
chromosome, or to artificial chromosome constructions, e.g., human artificial chromosomes (HACs), 
yeast artificial chromosomes (YACs), bacterial artificial chromosomes (BACs), bacterial PI 

10 constructions, or single chromosome cDNA libraries. (See, e.g., Hairingtcm, J J. et al. (1997) Nat 
Genet 15:345-355; Price, CM. (1993) Blood Rev. 7:127-134; and Trask, B J. (1991) Trends Genet 
7: 149-154.) Once mapped, the nucleic add sequences of the invention may be used to develop 
genetic linkage maps, for exaiiq>le, which correlate the inheritance of a disease state with the 
inheritance of a particular chromosome region or restriction ftagment Iraogth polymorphism (RFUP). 

15 (See, for example. Lander, E.S. and D. Botstrin (1986) Ptoc. Nafl. Acad Sd. USA 83:7353-7357.) 

Fluorescent in situ hybridization (PSS) may be correlated with other physical and genetic 
map data. (See, e.g., Hdnz-Ulrich, et al. (1995) m Meyers, supra, pp. 965-968.) Exan5)les of genetic 
m^ data can be found in various sdentific journals or at the Qnhne Mendelian Inheritance in Man 
(OMIM) World Wide Web site. Corrdatirai between the location of die gene encoding DME on a 

20 physical isap and a specific disorder, or a predisposition to a specific disorder, may help define the 
region of DNA assodated with that disoixier and thus may furthCT position^ 

fositu hybridization of dnbmosomal prqyaraticns and physical mapping tedm^ues, such as 
linkage analysis using estaMished chrcmosomal makers, may be used for extending ger^c maps. 
Often the placement of a gme on the chromosome of anotfaer mammalian spedes, such as mouse, 

25 may reveal assodatedmaricorsevrai if die exact chromosomal locus is not loD^ TIds information is 
vahiable to investigators seandnng for disease g^iies using posM ' 
techniques. Once the gene or gei^ responsible for a disease or syndrome have been crudely 
localized by genetic linkage to aparticolargen<mic n^on, eg., ataxia-relangieOasia to llq22-23, 
any sequences xoapping to that area may rqxresent associated or regulatory genes for further 

30 mvestigaticMi. (See, e.g., Gatti, ILA. et al. (1988) Nature 336:577-580.) The nucleotide sequence of 
the instant invention may also be used to detect differences in the chromosomal location due to 
translocation, inversion, etc., among normal, carrier, or affected individuals. 

hi anoQier CTobodimiCTt of the invration, DME, its catalytic or immunog^c fragments, or 
oligopeptides thoteof can be used for screening libraries of conqpounds in any of a variety of drug 

35 screwing techniques. The fiagQientenQpIoyed in such screening may be fbse in solution, afiBx 
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solid sii|n>Qit, borne on a cell surface, or located intracellularly. The formaticMi of bindiiig complexes 
between DME and the agent being tested may be measured. 

Anotb^ technique for drug scremiing provides forbig^ 1]irouglq>nt screening of conq^unds 
having suitable bmding afBnity to the piotem of interest (See, e.g., Geysen, et aL (1984) PCT 
5 application WO84/03564.) hi this method, large nunibers of different smaU test confounds a^ 
synthesized on a solid substrate. The test con9)ounds are reacted with DME^cxr fragments thereof, 
and washed. BoundDMEis thendetectedby methods well known in the art Purified DME can also 
be coated directly onto plates fw use in tibe aforementioned drug screening techniques. Altraoatively , 
non-neutralizing antibodies can be used to capture the peptide and immobilize it on a solid support 
10 In another embodunent, one may use conq)etitive drug soe^ung assays in which neutralizing 

antibodies enable of biiiding DME specificanycoiiqyete with a test conqp^ In 
this msaoDsCy antibodies can be used to detect the presence of any pqptide which shares one or more 
antigqadc detemunants with DME. 

hi additional embodiments , the nucleotide sequences which encode DME may be used in any 
IS molecular biology techniques that have yet to be developed, provided the new techniques rely on 
properties of nucleotide sequences that are currently known, including, but not limited to, such 
prcq)erties as the triplet graaetic code and specific base pair interactions. 

Without further elaboration, it is believed that one skilled in the art can, usmg die preceding 
descrq>tion, utilize the present mvention to its fullest extmt The following embodimmts are, 
20 therefiore, to be construed as merely illustrative, and not Gmitative of the remainder of the disclosure 
in any way whatsoever. 

The disclosures of all patents; applications, and publicaticxos. mentioned above and below, 
inchMfing U.S. Ser. No. 6QG54308, U.S. Ser. No. 60/256^189, U.S. Set. No. 60057,713, U.S. Ser. 
No. 60/262,706, and U.S. Ser. No. 60^,020, are hereby expressly mcorpoorated by refisr^ce. 

25 

EXAMPLES 

L Qnistniction of d>NA libraries 

Incyte d>NAs were denved fiom d>NA Ehraries desciiM 
(IncyteGenoniics,PaloAItoCA). Some tissues wm homogenized and lysed in guanidinium 

30 isothiocy anate, while others were homogenized ^aad lysed m phraol or in a suitable mixture of 
denaturants, such as TRIZOL (Life Technologies), a monqphasic solution of phenol and guanidine 
isothiocyanate. The resulting lysates wore centrifuged over C^QcusMons or mtracted with 
chloroform. RNA was precipitated from the ly sates with eidierisopropanol or 
ethanol, or by other routine methods. 

35 Phenol extraction and precq)itation of RNA were rq^eated as necessary to increase RNA 
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purity, b some cases, RNA was treated with DNase. For most libraries, poly(A>f SNA was isolated 
using oligo d(T>coupled paramagnetic particles (Promega), OUGOTEX latex particles (QIAGBN, 
C3iatswordi CA), or an OUGOTEX mRNA purification kit (QIAGEN). Alteiiiatively, RNA was 
isolated directiy from tissue lysates using other RNA isolation kits, e.g., the POLY(A)PURE niRNA 
5 purificaticm kit (Ambion, Austin TX). 

b some cases, Stratagene was provided with RNA and constructed the corresponding cDNA 
libraries. Otberwise,cDNA was synthesized arid cDNA libraries were constnictedwi 
vector system (Strata^e) or SUPERSCRIPT plasmid system (Life Technologies), using die 
recomriiendedpnx:edures or similar inethods known in the art (See, e.g.> AusubeL 1997. supra, units 

10 S. 1-6.6.) Reverse transcripticm was initiated using oUgod(T) or random primers. Synthetic 

oligonucleotide ad^ters were ligated to double stranded cDNA, and the cDN A was di^sted with the 
qyprppriate restriction enzymie or enzymes. For most lilsaries, the cDNA was size-selected (300- 
1000 bp) using SEFHACRYL SIOOO, SBPHAROSE CL2B, ot SEPHAROSB CM^B column 
chromatography (AinershamPharinada Biotech) or preparative agarose ^lelec cDNAs 

15 were ligated into compatible restriction enzyme sites of the polylinker of a suitable plasmid, e.g., 
PBLUESCRIPT plasmid (Stratagene), PSPORTl plasmid QUUTe Technologies), PCDNA2.1 plasmid 
(hivitrogra, Carlsbad CA), PBK-CMV plasmid (Stratagme), PCR2-TOPOTA plasmid Qhvitrogen), 
PCMV-iaS plasmid (Stratagene), pIGEN Oocyte Genomics, Palo Alto CA), pRARB (bcyte 
Genomics), or pINCY (Incyte Genomics), or derivatives tiiereof. Recombinant plasmids wm 

20 transformed into conspebsnt B.coli cells including XLl-Blue, XLl-BlueMRF, or SOLR from 
Stratag»ie <^ DHSo, DHIOB, or ElectroMAX DHIOB fiom life Technologies. 
IL Isolation of cDNA Cloiies 

Plasmids obtained as described in Example I were recovered from host cells by in vivo 
^ccirion using the IINI21AP vector 5y5tCTi(Stnitagaie) or by ceUlysi^ Plasmids w^e purified usii^ 

25 at least one of the following: a M^c or WIZARD ^finipieps DNA purification system OPtomega); an 
AGTC NGniprqp purificaticm kit (Edge Biosystems, Gaittiersburg MD); and QIAWELL 8 Plasmid, 
QIAWELL 8 PlusPlasnnd, QIAWELL 8 Una Plasnnd purification systmis cr the R£AX. PREP 96 
plasinid purification kit firom QIAGEN. Following precq^tati(Hi,plasinids were resuspended in 0.1 
ml of distilled water and stored, with or without lyophilization, at 4 ''C 

30 Atematively, plasmid DNA was amplified from host cell lysates using direct link PGR in a 

high-througtvut format (Rao, V.B. (1994) Anal. Biochem. 216:1-14). Ifost cell lysis and thermal 
cycling steps were carried out in a single reaction mixture. Sanq>les were processed and stored in 
384-well plates, and the concentration of msplihed plasmid DNA was quantified fiuorometrically 
usmg nCOGREEN dye (Molecular Probes , Eu^aie OR) and a FLUOROSKAN H fluorescaacc 

35 scanner (Labsystems Oy, Helsinki, Finland). 
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m. Sequendng and Analysis 

Mcyte cDNA lecovered in plasmids as desoribed in Exanqile n were segaenced as follows. 
Sequencing reactions were processed using standard methods cor Ugh-tbroughput instrumentation 
such as the ABI CATALYST 800 (Applied Biosystems) diecmal cycler or the PTC-200 thermal 

5 cycler (MI Research) in conjunction with the HYDRA microdisp^oser (Robbins Scientific) or the 
MICROLAB 2200 CBiamilton) liquid transfer system. dDNA sequraicmg reactions were prepared 
using reagents provided by Amersbam Pharmacia Biotech or supplied in ABI sequendng kits such as 
the ABI FEOSM BIGDYE Terminate cycle sequracmg ready reaction kit (Ajiplied Biosystems). 
Electrophoietic separation of cDNA sequencing reacticms and detection of labeled polynucleotides 

10 were carded out using the MEGABACB 1000 DNA sequencing system (Molecular Dynamics); the 
ABI PRISM 373 or 377 sequencing system (Applied Biosystems) in conjunction with standard ABI 
protocols and base calling software; or other sequence analysis systems known in the art Reading 
frames within the cDNA sequences w^ identified using standard methods (reviewed in Ausubel, 
1997, supra, unit 7.7). Some of the cDNA sequences were selected for extension using the techniques 

15 disclosed in ExaIE^>leVIIL 

The polynucleotide sequences derived fix>m Incyte cDNAs were validated by removing 
vector, linker, and poly(A) sequences and by masking ambiguous bases, using algorithms and 
programs based on BLAST, dynamic programming, and dinucleotide nearest neighbor analysis. The 
Incyte cDNA sequences or tcanslaticms thereof were then queried against a selection of public 

20 databases such as the GeuBank primate, rodent, mammalian, vertebrate, and eukaryote databases, and 
BLOCKS, PRINTS, DOMO, WIODOM; PROTEOMB databases with sequences ftom Homo sapims, 
Rattus norvegicus. Mus musculus, Caenodiabditis elepans, Saccharomvces cerevisiae, 
Schizosaccharomvces pombe. and Candida albicans flhcvte Goiomics, Palo Alto CA); and Mddm 
Markov inodel(HMN^ba5ed protein £umly databases such as ITA^ (HMM is a probabilistic 

25 approach which analyzes consCTSUs primary structures of gene femilies. See, for example, Eddy, 
S JL (1996) Curr. Opin. Struct BioL 6:361-365.) The quedes were p^ormed using programs based 
cm BLAST, FASTA,BLINIPS, and HMMER. The hocytecDNA sequences were assembled to 
produce full teoglfa polynucleotide sequences. Ahematively , GenBank cDNAs, GaiBank ESTs, 
stitched sequences, stretched sequences, or Genscan-predicted coding sequraces (see Examples IV 

30 and V) were used to extend hicyte cDNA assemblages to full length. Assembly was performed using 
programs based on Phied, Pfarqi, and Consed, and cDNA assemblages were screened for open 
reading franies using programs based on GeneMark, BLAST, and FACT The full length 
polynucleotide sequences were translated to derive the corresponding fiill length polyp^tide 
sequences. Alternatively, a polypeptide of the invention may begin at any of the methionine residues 

35 of the full Imgth translated polyp^tide. Pull length polyp^tide sequences w^subsequenfly 
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analyzed by querying against databases such as the GenBankprotem databases (geiqiept), SwissProt, 
the FRaiEOME databases, BLOCKS, PRINTS, DOMO, PRODOM, Ptosite, and hiddm Markov 
model CHNIM)-based protem faxnily databases such as PFAM. Full length polynucleotide sequraces 
aie also analyzed usmg MACDNASIS PRO software (Hitachi Software Engmeering, South San 

5 Francisco CA) and LASERGBNE software (DNASTAR). Polynucleotide and polypq)tide sequence 
aligmnmts are generated using de&ult parameters specified by the CLUSTAL algorithm as 
incorporated into the MBGALIGN multisequ^ace alignment program (DNASTAR), which also 
calculates the pment idratity between aligned sequ^ces. 

Table 7 summarizes the tools, programs, and algorithms used for the analysis and assembly of 

10 Inqrte cDNA and fiill Imgth sequ^oces and provides ^licable descriptions, references, and 

threshold parameters. The first column of Table 7 shows the tools, programs, and algorithms used, 
the second column provides brief descriptions theceof, the third column presents appropriate 
references, all of which are incorporated by refer^ice herein in their entirety, and the fourth column 
presents, where applicable, the scores, probability values, and other paramet^ used to evaluate the 

15 strength of a match betwera two sequences (the higher the score or the lower the probability value, 
the greater the identity between two sequmces). 

The programs described above for the assembly and analysis of fiill length polynucleotide 
and polypeptide sequences were also used to identify polynucleotide sequence firagments fiom SEQ 
ID NO: 14-26. Hragmmts from about 20 to about 4000 nucleotides which are useful in hybridization 

20 and anq)lification technolp^es are described in Table 4, cotunm 2. * 

IV, Identificatioii and Edifiiig of Coding Sequences frimi Genomic DNA 

Putative drug metabolizing enqmoes were initially id^tified by running the Genscan gene 
identification prOgraTn ag^i«g^ pnhHc gmnmic seqiigaice databases (e>g., gbpri and gbhtg). Grascanis 
a gra^ral-purpose gene identification program which analyzes genomic DNA sequences from a 

25 variety of organisms (See Burge, C and S. Kariin (1997) J. MoL BioL 268:78-94, and Burgo, C. and 
S. Kariin (1998) Curr. Opin. Struct BioL 8:346-354). The program concat^iates predicted exons to 
.fcmn an asseinbledcDNAsequmce extending from a metUooine to a stop cod^ Theoutputof 
Genscan is a FASTA database of polynucleotide and polyp^de sequences The maximum range of 
sequrace for Genscan to analyze at once was set to 30 U>. To detmnine which of these Genscan 

30 predicted cDNA sequences racode drug metabolizing enzymes, the wooded polypeptides were 
analyzed by querying against PFAM models for drug metabolizing enzymes. Potential drug 
metabolizing razymes were also identified by homology to Ihcyte cDNA sequences that had been 
annotated as drug metabolizing enzymes. These selected Genscan-predicted sequences w^ then 
compared by BLAST analysis to the g^p^t and gbpri public databases. Where necessary, the 

35 G^DLScan-piedicted sequ^ces were then edited by comparison to the top BLAST hit from geaipept to 
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CQnect ertois in the sequence predicted by G^iisca^ BLAST analysis 

was also used to find any Iticyte cDNA or public cDNA cov^age of the Genscan-fnedicted 
sequences, thus providing evidence for tcansciiption. When Ihcyte cDNA coverage was available, 
this inforniation was used to correct or confiim the Genscan predicted se^ FuUl^gth 
5 polynucleotide sequences wm obtained by assembling Genscan-predicted codmg sequences with 
hicyte cDNA sequences and/or public cDNA sequraces using the assembly process described in 
Exanq)le IIL Alternatively, full length polynucleotide sequences wm derived entirely bom edited or 
unedited Genscan-piedicted coding sequences. 

V. Assembly of Genomic Sequence Data wifli cDNA Sequence Data 

10 "Stitched" Seqoenccs 

Partial cDNA sequences wctc extmded with exons predicted by the Geoscan gene 
identification program described in Example IV. Partial cDNAs assembled as described in Exanq>le 
ID were mapped to geuomic DNA and parsed into clusters containing related cDNAs and Genscan 
exon predictions fiom one or more genomic sequences. Each cluster was analyzed using an algorithm 

15 based on gr£q>h tiheory and dynamic progranGmoing to integrate cDNA and genomic inf otmation, 
generating possible splice variants that were subsequently confirmed, edited, or extended to create a 
full length sequrace. Sequencemtervdsinwhichtheentirelengthof the interval was present on 
more than one sequence in the cluster were identified, and intervals thus id^tified were considered to 
be equivalent by transitivity. For example, if an interval was present on a cDNA and two g^ornic 

20 sequmces, then all three intervals weie considered to be equivalent This process allows mnelated 
but consecutive genomic sequences to be brou^t togetfa^, bridged by cDNA sequence. Intervals 
thus id^tified were then **stitched'* together by the stitching algorithm in die order that they ^ypear 
along their parent sequoices to generate die Icm^st possible sequeiKy>, as well as sequKice variants. 
Linkages betwe^ intervals which proceed along one type of parent sequence (cDNA to cDNA or 

25 genomic sequence to genomic sequence) wctq given preference over linkages which change parent 
type (cDNA to genomic sequence). The resultant stitched sequences were translated and compared 
by BLAST analysis to tihe genp^ and gbpri public databases . hicorrec^ exons predicted by Geaiscan 
were conected by conqiaiiscm to the tqp BLAST fait fi^ Sequences wane further extended 

widi additional cDNA seqwnces, or by inspection of genomic DNA, when necessary. 

30 **Stretchctf ' Sequences 

Partial DNA sequraces were extended to full length with an algorithm based on BLAST 
analysis. Krst, partial cDNAs assembled as described in ExBxnple M were qu^ed against public 
databases such as the GenBank primate, rodent, TnamTpAliaTi, vertebrate, and eukaryote databases 
usmg the BLAST program. ThenearestGenBankproteinhoinolog was then cQnq)arEd by BLAST 
* 35 analysis to either McytecDNAsequraices or GenScanexon predicted sequ^cesdesoibe^ 
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ExaiiQ>leIV. A chuneric protein was gene]:ated by 

(HSPs) to map the translated sequences onto the GeoBank protein homolog. fiasertions or deletions 
may occur in the chimedc protein with lespect tote The 
GenB ank protein homolog, the chimeric protein, or both were used as probes to search for 

5 homologous genonucsequmresfirom the public hunm genome databases PWial DNA sequences 
were thmfore "sdretched" or ^itended by the addition of homologous genomic sequences. The 
resultant stretched sequences were examined to detmnine whether it contained a conq>lete gene. 
VL Oiromosomal Mapping of DME Encoding Polynudeolides 

The sequences which were used to assemble SEQ ID NO:14-26 were conq>ared with 

1 0 sequraces fiom tibe Incite LIFBSEQ database and public domain databases using BLAST and other 
inoyplementations of the Smith-Waterman algorithm. Sequences from these databases that matched 
SEQ ID NO: 14-26 were assembled into clusters of contiguous and overlapping sequences using 
assembly algorithms such as Phr^ (Table 7). Radiation hybrid and genetic m^ing data available 
fitnn puUic resources such as the Stanford Human Gcacam Center (SHGQ, Whitehead Institnte for 

15 Genome Research (WIGR), and G^fthon weie used to detemune if any of the clustraed sequences 
had been previously mapped. Inclusion of a mapped sequence in a cluster resulted in the assigninent 
of all sequences of that chister, including its particular SEQ ID NO:, to diat map location. 

locations are represented by Tanges, or intervals, of human chromosooies. Themap 
position of an interval, in centiMorgans, is measured relative to die terminus of the dnxxmosome' s p- 

20 arm. (ThecentiMorgan (cM) is a unit of measurement based on recombination fiequmdes b^een 
chromosomal markers. On average, IcM is roughly eqmvalent to linegabase (Mb) of DNA in 
hu]nans,althoughthiscanvary widely due to hot and coUqpots of recon^ ThecM 
distances are based on gen^ic inadoen imqyped by G&idfacm wU 

liybridinarkers whose sequences were included in each of the clusters. Human genmne maps and 
25 odior resources available to the public, such as the NCBI "GeDeMap'99" World Wide Web site 
Oittp://www jictnj)bn.nih.gov/genei^^ can be eDq>loyed to determine if previously identified 
disease gqnesmq) within or in proximity to die intenrals indicated abo Id this manner, SEQ ID 
NO: IS was mqyped to dnomosome 10 witiun die interval fitom 13 1.90 to 134.70 centiMorgans. 
Vn. Analysis of Polynudeolide Expression 
30 Northern analysis is a laboiatory technique used to detect die presence of a transcript of a 

g^e and involves the hybridization of a labeled nucleotide sequence to a membrane on which RNAs 
from a particular ceU type or tissue have been bound. (See, e.g., Sambrook, supra, ch. 7; Ausnbel 
(1995) supra, ch. 4 and 16.) 

Analogous conq>uter techniques ^plyiogBLACT weie used to search for identical or related 
35 molecules in cDNA databases such as GenBankorLlFESEQ Ohcyte Genonncs). This analysis is 
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much faster than multiple menabiane-based hybridizadons. hi additian^ the sensitivity of the 
computer search can he modified to detomme whether any particular match is categorized as exact or 
sunilar. The basis of the search is the product score, which is defined as: 

5 BLAST Score x Percent Identity 

5 X midmum {length(Seq. 1), lengtii(Seq. 2)} 

The product score takes mto account both tiie degree of sinnlarity between two sequences and the 
length of the sequence match. The product score is a normalized value between 0 and 100, and is 

10 calculated as follows: the BIJ^T score is niultq>Ued by the peai^ent nucleotide idmtity aM 

productisdividedby (5 tin^ the length ofthe shorter of the two sequences). The BLAST score is 
calculated by assigning a score of +5 for ev^ base that matches in a high-scoring segment pair 
(HSP), and -4 for every mismatch. Two sequaices may share more than one EBSP (separated by 
gaps). If there is more than cmeHSP, then the pak with the hi^est BLAST score i^ 

15 the product score. The product score represents a balance between fiacticnaloverhp and quaKty in a 
BLAST alignm^. For exan^le, a productscoreof 100 is produced only for 100% identity over the 
entire length of the shorter of the two sequences being compared. A product score of 70 is produced 
either by 100% identity and 70% overlap at one end, or by 88% identity and 100% overlap at the 
other. A product sccxre of SO is produced eith^ by 100% identity and 50% overlap at one end, (kt 79% 

20 ideatity and 100% overly. 

Alternatively, polynucleotide sequences »codmg DME are analyzed with respect to the 
tissue sources from which they were derived For example, some full Imgth sequences are 
assend>led, at least in part, with overi^q[riiigiK^ EachcDNA 
sequence is derived from a cDNA library constrwrted from a human tissue. Each human tissue is 

25 classified into one of the following offgan/tissue categories: cardiovascular system; connective tissue; 
digestive system; einbryonic structures; endocrine system; exocrii^ ^ands; g^talia, female; 
g^utalia, male; gexm cells; hemic and mwnnnft system; livo; musculoskeletal system; nervous 
syston; pancreas; respiratory system; sense organs; skin; stomatognatfaic system; unclassified/mixed; 
en* urinary tract Thennmberof libraries in each category is counted and divided by the total nu^ 

30 of libraries across all categories. Sinnlarly , each human tissue is classified into one of the following 
disease/condition categories: cancer, cell line, developmimtal, inflammation, neurological, trauma, 
cardiovascular, pooled, and other, and the number of libraries in each category is counted and divided 
by the tc^ number of libraries across all categories. The resulting percentages reflect the tissue- and 
disease-specific expression of cDNA encoding DME. cDNA sequm:es and cDNA library/tissue 

35 information are found in the LIFESBQ GOLD database (hicyte Genomics, Palo Alto CA). 
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Vni. Extoiffl n fDME Encoding Polynadeotides 

Full length polyancleodde sequences wece also produced by ^ctension of an appropriate 
fragment of the full length nK>lecule using oUgonucleotide One 
primer was synthesized to initiate 5 ' extension of the known firagment, and the olh^ primer was 
5 synthesized to initiate 3* ^Eteusion of the known fragment. The initial primers were designed using 
OLIGO 4.06 software (National Biosciences), or anoth^ appropriate program, to be about 22 to 30 
nucleotides in length, to have a GC contrat of about 50% or more, and to anneal to the target 
sequence at temperatures of about 68''C to about 72^C Any stretch of nucleotides which would 
result in hairpin structures and primer-prim^ dirmizations was avoided. 

10 Selected human cDNA libraries weie used to extend the sequence. Ifmorethanone 

extension was necessary or desired, additional or nested sets of primers were designed 

High fidetityamplificatim was obtamed by PCK using methods weU PGR 
was perfonned in 96-weU plates using the FrC-200the]iDal cycler (MI Resean:h^ Thereaction 
mix contained DNA tenqilate, 200 nmol of each primer, reaction buffer containing Mg^% QiH^SO^ 

15 and 2-mercaptoethanol, Taq DNA polymerase (Amersham Pharmacia Biotech), ELONGASE enzyme 
(Life Technologies), and Pfii DNA polymerase (Stratagene), with the following parameters for primer 
pair pa A and pa B: Step 1: 94^C 3 mm; Step 2: 94**C, 15 sec; Step 3: 60**C 1 mm; Step 4: 68*C 
2rniiuStep5: Stqps2,3,and4repeated20times;Step6:68''QSinin;Step7: stxxageat4''C Lithe 
alternative, the parameters for primn pair T7 and SK+ were as follows: Step 1 : 94 *^ C, 3 min; Step 2: 

20 94''C IS sec; Step 3: ST'C, 1 min; Step 4: dS'^C, 2 min; Step 5: Steps 2, 3, and 4 repeated 20 times; 
Step 6: eS^'C, 5 min; Step 7: storage at 4''C 

The concentration of DNA in each well was determined by dispensing 100 fil PICOGREEN 
quantitation reagmt (0.25% (v/v) PIOXatEEN; Molecular Ptobes, Eugene OR) dissolved in IX TE 
and 0.5 ^lof undiluted PCR product into each weU of an qpaquefhioriineterpla^ 

25 Actm MA), allowing the DNA to bind to the reagent The plate was scanned in a FhiorosiQnll 

(f ■aKyyjfft'nryf PUftlgmVi^ Fii^lgiivl) tn mftngnm thft finrmsceancei of the sample arid tn qnanrify the 

concentratiQnofDNA. A 5 /il to 10 Ail afiquot of the reaction nnxtuce was analyzed by 

electrophoresis cn a 1 % agarose g^ to d^eimiiie ^rfddi reactions we^ 

sequence. 

30 The exti^ed nucleotides were desalted and concentrated, transferred to 384-well plates, 

digested with CMJI cholera virus endonuclease (Molecular Biology Research, Madison WI), and 
sonicated or sheared prior to religption into pUC 18 vector (Amersham Pharmacia Biotech). For 
shotgun sequeiK:ing, fhe digested nucleotides were separated on low concentraticHi (0.6 to 0.8%) 
agarose gels, fragments were excised, and agar digested with Agar AGE (Promega). Extended clones 

35 were religated using T4 ligase (New England Biolabs, Beverly MA) into pUC 1 8 vector (Amersham 
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Phammcia Biotech), treated with Pfii DNA polymerase (Stratagene) to fill-in restiictim site 
overfiangs, and transfected into conyetCTt R coli cells. Transfcnxned cells were selected on 
antibiotic-containing media, and individual colonies were picked and cultured overnight at ST'C in 
384-well plates in LB/2x carb liquid media. 
5 The cells were ly sed, and DNA was aixq[>lified by PGR using Taq DNA polymerase 

(Amersham Pharmacia Biotech) and Pfu DNA polymerase (Stratagene) with the following 
parameters: Step 1: 94''C, 3 mm; Step 2: 94°C; 15 sec; Step 3: eO'^C, 1 min; Stq? 4: 72'*C, 2 min; 
Step S: steps 2, 3, and 4 repeated 29 times; Step 6: 72**C, S min; Step 7: storage at 4''C DNA was 
quantified by PICX)GREEN reagent (Molecular Probes) as described above. Sanoples with low DNA 

10 recoveries were rean^lified using the same conditions as described above. Samples were diluted 
with 20% dimethysulfoxide (1:2, v/v), and sequenced using DYENAMIC energy transfer sequencing 
primers and the DYENAMIC DIRECT kit (Ammham Pharmacia Biotech) or the ABI PRISM 
BICE>YE Tenninatar cycle sequracing ready reacticm Idt (Af^lied Biosystems). 

in like manner, fiill length polynucleotide sequences are v^ified using the above procedure or 

IS are used to obtain 5* regulatory sequences using the above procedure along with oligonucleotides 
designed for such extension, and an q>propriate genomic library. 
DL LabeKng and Use of Individual Hybridization Pftibes 

Hybridization probes derived fiomSEQ ID NO:14-26 are enq[)loyed to screen cDNAs, 
gnomic DNAs, or nRNAs. Although the labeling of oUgpnucleotides, crasisting of about 20 base 

20 pairs, is spedficaUy described, essraitially die same procedure is used with larger nucleotide 
firagmimts. OUgonucleotides are designed using state<>f-the-art software such as OLIGO 4.06 
software (National Biosciences) and labeled by combining 50 pmol of each oligom^, 250 ^Ci of 
[y.32p] adenosine trq)hosphate (Amersham Pharmacia Bic^ech), and T4 polynucleotide kinase 
(DuPont NEN, Boston MA). The labeled oligonucleotides are substantially purified using a 

25 SH^HADEX G-2S superfine size exclusion dextran bead column (Amersham Pharmacia Biotech). 
An aliquot containing 10'' counts per minute of the labeled probe is used in a typical membrane-based 
hybridization analyds of human genomic DNA digested with one of the followiiig endonndeases: 
Ase I, BgJ It Eco RI, Pst t Xba I, or Pvu n (DuPrat NEN). 

Tte DNA from each digest is fractionated cm a 0.7% agarose gel and transferred to nylon 

30 membranes (Nytran Plus, Schleicher & Schuell, Durham NH). Hybridization is carried out for 16 
hours at 40°C. To remove nonspecific signals, blots are sequentially washed at room tenoqperature 
under conditions of up to, for exanq)le, 0.1 x saline sodium citrate and 0.5% sodium dodecyl sulfate. 
Hybridization patterns are visualized using autoradiography or an alternative imaghig means and 
compared. 

35 X. Microarrays 
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The linkage or synthesis of airay elements upon a nuCToarray can be achieved ntilizmg 
photolithography, piezoelectric printing (ink-jet printing, See, e.g., Baldeschweil^, snpta. \ 
inechanical miciospotdng technologies, aiui derivatives thei^ The substrate in each of the 
aforementioned technologies should be uniform and solid with a non-porous surface (Schena (1999), 
5 supraV Sugg^ed substrates include siUcoD, siUca, glass sUdes, glass chips, and siUcon waf 
Alternatively, a procedure analogous to a dot or slot blot may also be used to arrange and link 
elements to the surface of a substrate using thermal, UV, chemical, or mechanical bonding 
procedures. A typical array may be produced using available methods and machines well known to 
those of cmlinarysIdU in the art and may contain any (See,e.g., 

10 Schraa, M- et al. (1995) Science 270:467-470; Shalon, D. et al. (1996) Genome Res. 6:639-645; 
Marshall, A. and J. Hodgson (1998) Nat Biotechnol. 16:27-31.) 

Rdl length cDNAs, E3q[Hiessed Sequence Tags (ESTs), or ftagm&nts or oligomers thereof may 
conqprise the elem^ts of the microairay . Fragments or oligomers suitable for hybridization can be 
selected using software weU known in the art such as LASERGENEsof^ The 

15 array elements are hybridized with polynucleotides in a biological sample. The polynucleotides in the 
biological sample are conjugated to a fhioiescent label or other molecular tag for ease of detecticHi. 
After hybridization, nonhybridized nucleotides from the biological sample are removed, and a 
fluorescence scanner is used to detect hybridization at each array element Alternatively, laser 
desorbtion and inassspectromi^ry may be used for detection of hybridization. Thedegreeof 

20 complementarity and die relative abundance of each polynucleotide which hybridizes to an elemoot 
on the microarray may be assessed. Lioiieeinlxxliment,microarray preparation and usage is 
described in d^ail below. 
Tissue or Cdl Sft wpl*^ l^^rg paration 

Total RNA is isolated from tissue samples using the guanidinium thiocyanate method and 

25 poIy(A)* RNA is purified using the oligo-(dT) cellulose method. Each poly(A)^ RNA sample is 
reverse transcribed usiiigMNfLVreveise-transcriptase, 0.05 pE^^ IX 
first strand buffer, 0.03 umts/^l RNase inhitntor, 500 |iM dATP, 500 fiM dOIP, 500 fiM dTTP, 40 
fM dCTP, 40 pM dCTP-Cy3 (BPS) or dCTP-Cy5 (Ammham Pharmacia Biotech). Ilieieverse 
tnmscriptian reaction is perfonned in a 25 ml volume contaiiK^ 

30 GEMBRI(jHTkits(Incyte). Specific control poly(A)^RNAs are synthesized by mistranscription 
fiom non-coding yeast genomic DNA. After incubation at 37** C for 2 hr, each reaction sample (one 
with Cy3 and another with CyS labeling) is treated widi 2.5 nd of 0.5M sodium hydroxide and 
incubated for 20 minutes at 85''C to the stop die reaction aiid degrade die RNA. San^les are purified 
using two successive CHROMA SPIN 30 gel filtration spm columns (CXONTECH Laboratories, Ihic. 

35 (CXONTEdQ, Palo Alto CA) and after combining, both reaction sanq>les are ^hanol precipitated 
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using 1 ml ofgtycQgen(lii^mlX 60 xnl sodium ac^ The saiiq>le is 

then dried to conq>letion nsing a SpeedVAC (Savant Instnunents Inc., Holhrook NY) and 
lesuspended m 14 fd SX SSaO.2% SDS. 
Microarrav Preparation 
5 Sequencesof the preset invention aie used to generate anay elements. Each array element 

is anq>lified fiom bact^nal cells containing vectors with cloned cDNA inserts. PGR amplification 
uses primers complementary to the vector sequences flanking die cDNA insert Array elements are 
amplified in thirty cycles of PGR from an initial quantity of 1 -2 ng to a final quantity greater tlian 5 
fig. Anq>lified array elen^nts are then purified using SEPHAORYLr^^ 
10 Biotech). 

Purified array elements are irnmobilized on polyiner-coated glass slides. Glass microscope 
slides (Coming) are cleaned by ultrasound in 0.1% SDS and acetone, with extensive distilled water 
washes between and after treatments. Glass sfides are etched in 4% hydrofluoric add (VWR 
Scientific Products Corporation (VWR), West Chester PA), washed extensively in distilled water, 
15 and coated with 0.05% ammopropyl silane (Sigma) in 95% ethanol. Coated slides are cured in a 
IWCoven. 

Anay elements are applied to the coated glass substrate using a procedure described in U.S. 
Pat^t No. 5,807,522, incorporated h^idn by reference. 1 fd of tte array element DNA, at an average 
concentration of 100 ng^^ is loaded into the open capillary printing elsDoent by a high-speed robotic 

20 i^paratus. The apparatus then d^sits about 5 nl of array eleriientsainple per slide. 

MicToarrays are UV-crossUnted using a STRATALINKERTJV-crosslinker (Stratagene). 
Microarrays aro washed at room temperature once in 0.2% SDS and tinee times m 
Non-specific Irinding sites are blocked by incubation of microarray s in 0.2% casein in jAosphate 
buffered saline (PBS) (Tropix, Ina, Bedford MA) fi»r 30 minutes at W C followed by washes m 

25 0.2% SDS and distilled water as before. 
Hybridization 

Hybodization reactions contain 9 fd of san^le ndxtme consisting of 0.2 lig each of Cy3 and 
CySlabdedcDNAsynAesis products in 5XSSC, 0.2% SDS hybridization buffer Thesanqple 
mixture is heated to 65 C f or 5 minutes and is aliquoted onto the microarray surface and covered 

30 with an 1.8 cm^ coverslip. The arrays are transferred to a wateqnroof chamber having a cavity just 
slightiy larger than a microscope slide. The chaniber is kept at 100% humidity inteinally by the 
addition of 140^ ofSXSSC in a comer of the chamber. The chamber containing the arrays is 
incubated for about 6^ hours at 60*^0 Thearraysare washed for 10 rnm at 45"* C ina finst wash 
buffer (IX SSC, 0.1% SDS), tinee times for 10 minutes each at 45''C in a second wash buffer (O.IX 

35 SSC), and dried. 
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Detection 

Repaiter-labded hybridizadon conqylexes aie detected with a microscope equipped widi an 
Imova 70 mixed gas 1 0 W laser (CoheieDt, Inc., Santa Qaia CA) cs^able of genmting spectral lines 
at 488 nm for excitation of CyS and at 632 nm for excitation of CyS. The excitation lasCT Ijght is 

5 focused on die anay using a 20X microscope objective (Nikon, Lic^^ The slide 

containing the array is placed on a computer-conttolled X-Y stage on the microscope and raster- 
scanned past the objective. The 1.8 cm x 1.8 cm array used in fhe present example, is scanned with a 
resolution of 20 micrometm. 

In two separate scans, a mixed gas multiline laser excites the two fluorophoces sequmtiaUy . 

10 Emitted light is split, based on wavelengOi, into two photomultiplier tube detectors (PMT R1477, 
Hamamatsu Photonics Systems, Bridgewater NJ) corresponding to the two flutHophores. Ap[Ht>priate 
filters positioned between the array and the photomuMpHer tubes are used to filter The 
ftmigfiifm mfl TtTna of llift fliinmphnfrpjt iiseil are trni for Hyj^ and 650 nm for Cy5. Each array is 
tyjdcally scanned twice, one scan per fluorophore usmg die appropriate filters at the laser source, 

15 although die ^>paiatu5 is capable of recording the spectra fipom both fluorophores simultaneously. 

The sen^tivity of the scans is typically calibrated using the signal intensity gqaerated by a 
cDNA control qiedes added to the sainple mixture at a Imown concents A specific location on 
the anay contains a complemrailary DNA sequence, allowing die intensity of the signal at that 
location to be coirelated with a wdght ratio of hybridizing species of 1:100,000. When two sanqiles 

20 from different sources (e.g., representing test and control cells), each labeled with a differrait 
fluorophore, are hybridized to a single array for the purpose of identifying gmes diat are 
differaitiany expressed, the caEbsration is done by labeling samples of the calibrating cDNA with die 
two fluon^iores and adding identical amounts <rf eadi to the lybridization mixtme. 

The output of the photomultiplier tube is digitized using a 1 2-l»t RTI-835H analog-to-digital 

25 (A/D) convmion board (Analog Devices, Ina , Norwood MA) installed in an IBM-conqpatible PC 
computer. Tte digitized data are displayed as an image vfbessi the signal intensity is m^[>ped imng a 
linear 20-coloartransfoiinatio8i to a psradocolor scale laii^giilg from hhie Qow signal) to red O^S^ 
signal). The data is also analyzed quantitatively. Where two different fhiosophosesaieesudted and 
measured simultaneously, the data aie first corrected fior optical crosstalk (due to ovedapping 

30 emission spectra) betwera the fluorophores using each fluorophorB*s ^ssion spectrum. 

A grid is siqieriuaqx)sed over the fluorescence signal image such that the signal from each 
spot is centered in each element of die grid. The fluorescence signal widun each element is then 
int^iated to obtain a numerical value correspondii^ to the average intei^^ The 
software used for signd anatysis is the CffiNfT(X)IJS gene expression 
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XI. Complementary Polynndeotides 

Sequences conq^lementaty to the DMErencoding sequences, or any paits theieof, are used to 
detect, decrease, or inhibit expression of naturally occurring DME. Although use of oligonucleotides 
conpismg from about 15 to 30 base pairs is described, essentially the same procedure is used with 
5 smaller or with larger sequ^ceftagments. Appropriate oligonucleotides are designed using OUGO 
4.06 software (National Biosciences) and the coding sequence of DME. To inhibit transcription, a 
conq>lementary oligonucleotide is designed from the most unique S' sequence and used to prevent 
promoter binding to the coding sequence. To inhibit translation, a complemeatary oligonucleotide is 
designed to prevent libosomal binding to the DME-encoding transcript 

10 Xn. Egression of DME 

Expression and purification of DME is achieved using bacterial or vims-based e3q>ression 
systems. For expressiou of DME in bacteria, cDNA is subcloned into an appropriate vector 
containing an antibiotic resistance g^e and an inducible promote tbat directs hig|i levels of cDNA 
transcription. Exanq>les of such promote include, but are not limited to, the trp-lac {fac) hybrid 

15 pronK>ter and the T5 or T7 bacteriophage proinoter in conjunction with ttielaco^ 
element Recombinant vectors are transformed mto suitable bacterial hosts, e.g^ 
Antibiotic resistant bacteria express DME upon induction with isoprcpyl beta-D- 
thiogalactopyranoside (IPTG). Expression of DME in eukaryotic cells is achieved by mfecting insect 
or mammalian cell lines with recombinant Autopraphica califomica nuclear polyhedrosis virus 

20 (AcMNPV), commonly known as baculovirus. .The nonessential polyhedrin gene of baculovirus is 
replaced with cDNA encoding DME by either homologous recombination or bactmal-mediated 
transposition involving transfer plasmid intermediates. Viral infectiyity is maintained and the strong 
polyhedrin promoter drives hi£^ levels of cDNA tran5Ciq>tion. Recombinant baculovirus is used to 
infect Spodoptera fmgipeaila (Sfi9) insect cells in most cases, or human hqiatocytes, in some cases. 

25 lofectionof the latter requires additional geiieticinodifications to baculovirus. (See Engelhard, B-EL 
et al. (1994) Proc. Natl. Acad. ScL USA 91:3224-3227; Sandig, V, ^ al. (1996) Hum. Gme Hier. 
7:1937-1945.) 

hi most e;qiressi(Hi systems, DME is synthesized as a fusicm protdn with, eg., glutathioiie S- 
transferase (GST) or a peptide epitope tag, such as FLAG or 6-Ifis, p^mtting rapid, singlenstep, 

30 afSnity-based purification ofreconibinant fusion protein from crude ceUlysates^ GST, a 26- 
kilodalton enzyme firom Schistosoma iaponicum. enables the purification of fusion proteins on 
immobilized glutathione under conditions that fpaintam protein activity and antig^city (Amersham 
Pharmacia Biotech). Following purification, the GST moiety can be proteolytically cleaved fit)m 
DME at specifically engiaeered sites. FLAG, an 8-amino acid peptide, enables imnmnoaffioity 

35 purification using commercially available monoclonal and polyclonal anti-FLAG antibodies (Eastman 
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Kodak). 6-His, a stretch of six cooseciitive histidine residues, enables purificatton on mctal-chelate 
resins (QIAGEN). Methods for protein expression and pnrification are discussed in Ausubel (1995, 
supra, ch. 10 and 16). PorifiedDME obtained by these methods can be used directly in the assays 
shown in Examples XVI, XVII, and XVm, where applicable. 
5 Xm. Functional Assays 

DME function is assessed by cTqpresinng the sequences encoding DME at physiological]^ 
elevated levels in mammalian cell culture systems. cDNA is subcloned into a mammalian eTq^xression 
vector omtaining a strong {nromoter tibat drives high levels of cDNA e^qnession. Vectc»rs of choice 
include FCMV SPORT (Life Technologies) and FCR3.1 (Ihvitrogen, Carlsbad CA), both of whidi 

10 contain the cytomegalovirus promoter. 5-10 fig of recombinant vector are transienily transfected into 
a human cell line, for exam|>le, an endothelial or hematopoi^c cell line, using eith^ liposome 
formulations or electroporation. 1-2 /zg of an additional plasnud containing sequences encoding a 
marker protein are co-transfected. E}q9ression of a marloerpiotein provide 
transfeded cells firomnontransfected cells and is areliable predictor of cDNA expression from the 

IS reconibmant vector. Marker proteins of choice include, e.g.. Green Huorescrat Protein (GFP; 

Qontech), CD64, or a CD64-GFP fusion protein. Flow cytometry (PCM), an automated, laser optics- 
based technique, is used to identify transfected cells expres sing GFP <^ CD64-GFP and to evaluate 
the sqxyptotic state of the cells and other cellular properties. FCM detects and quantifies the iq>take of 
fluorescent mdeculestiiat diagnose events precedii^ or coincident with ce^ Theseevents 

20 include changes in nuclear DNA content as measnied by staining of DNA with propidimn iodide; 
changes in ceD size and granularity as measured by forward light scatter and 90 degree side li£^ 
scatter; down-iegulation of DNA synthesis as measured by decrease in faromodeoj^undine iqytake; 
alterations in expies^on of ceU sur£Eu:e and intracellular proteins as n^ 
q»ecific antibodies; and alterations in plasma mem^^ 

25 fluorescein-conj ugated Annexin V |»rotein to the cell surEace. Methods in flow cytometry are 
discussed in Qimerod, M.G. (1994) Plow Cytometry, Oxfonl, New YodcNY. 

The inflnenre of DME choi ggne expression can be assessed using hig|ily pnrifed populations 
of cells transfected witii sequoices encoding DME and either CD64 or CD64-GFP. CD64andCD64> 
GFP are ejqpressed on the surface of transfected cells and bind to conserved regions of human 

30 innnunoglobulin G (IgG). Transfected cells are efficiendy separated from nontransfected cells using 
magnetic beads coated with either human IgG or antibody against CD64 (D YNAL, Lake Success 
NY). mRNA can be pmifiedfirom the cells using methods weUlmown by those of skiU in th^ 
E^[iression of mRNA encoding DME and other genes of interest can be analyzed by northern analysis 
or microairay techniques. 

35 XIV. Prodncti nof DME Specific Antibodies 

99 



wo 02/46426 



PCT/USOl/47429 



DME substantially purified using polyacrylannde gel electrophoresis (PAGE; see, e.g., 
Harrington, M.G. (1990) Methods Enzymol. 182:488-493), or other purification techniques, is used to 
immunize rabtnts and to produce antibodies using standard protocols. 

Alternatively, the DME amino acid sequ^ice is analyzed using LASERGENE software 
5 CDNASTAR) to determine regions of high immuno^nicity, and a corresponding oligopeptide is 
synthesized and used to raise antibodies by means known to those of skill in the art Methods for 
selection of appropriate qpitopes, such as those near the C-terminus or in hydn^hific re^ons are well 
described in the art (See, e.g., Ausubel, 1995, supra, ch. 11.) 

Typically, oligopeptides of about 15 residues in length are synthesized using an ABI 43 lA 
10 peptide synthesizer (AppUed Biosystenis) using FMCX: chemistry and coupled to EIJI(Sigm^ 

Aldrich, St Louis MO) by reaction with NHmaleunidobenzoyl-N-hydro^sucdmmide ester (MBS) to 
increase immunogenicity. (See, e.g., Ausubel, 1995, supra .) Rabbits are immunized with the 
oligopeptide-KLH CQn:q>lex in complete Freund's adjuvant Resulting antism are tested for 
antipqitide and anti-DME activity by, for exaniple, binding tiie pqptide or DME t 
15 blocking with 1% BSA, reacting with rabbit antisera, washing, and reacting with radio-iodinated goat 
anti-rabbit IgG. 

XV. Purification of Natarally Occnrrii^ DME Usmg Specific Antibodies 

Naturally occurring or recombinant DME is substantially purified by inmumoafBnity 

chromatography using antibodies specific for DME. An inmaunoafBnity column is constructed by 
20 covalentiy coupling anti-DME antibody to an activated chromatographic resin, such as 

CNBr-activated SEnSAROSE (Amersham Pharmacia Biotech). Aftor the coupling, the resin is 

blocked and washed according to the manufacturer's instructions. 

Media containing DME are passed over the immunoafBnity column, ami the column is 

washed under conditions tiiat allow the preferential absorbance of DME (e.g., high icmic strraigtii 
25 buffers in the presraice of deterg^t). The column is eluted under conditions that disrupt 

antibody/DME Unding (e.g., a buffer of pH 2 to pH 3, or a hig^ concentration of a chaotrope, such as 

urea or thiocyanate icm), and DME is coHectBd. 

XVL Identification of Molecnks micfa Interact ^nOi DME 

DME, or biologically active fiagments tiiofeof, are labeled with BoltODhEbmter reagoit 
30 (See. e.g., Bolton, AR and WAl. Hunter (1973) Biochem. J. 133:529-539.) Candidate molecules 

previously arrayed in the weUs of a multi-well plate are incubated with the labeled DME, washed, and 

any wells with labeled DME conq)Iex are assayed. Data obtained using different concentrations of 

DME are used to calculate values for the number, afBnity, and assodaticm of DME with the candidate 

molecules. 

35 Alternatively, molecules interacting witii DME are analyzed using the yeast two-hybrid 
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system as described in Fields, S. and O. Song (1989) Nature 340:243-246, or using conuneiriaUy 
available kits based m the two-hybrid system, such as the MATCHMAKER system (QoDtech). 

DMEmay also be used in the PATHCAIJLINO process (CiiraGen Coip., New Haven CT) 
which enq)loys the yeast two-hybrid system in a high-tbioughput manner to determine all interactions 

5 between the protdns encoded by two large libraries of gmes (Nandabalan, EL et al. (2000) U.S. 
Patent No. 6,057,101). 
XVn. Demonstration of DME Activity 

Cytochrome P450 activity of DME is measured using the ^hydroxylation of aniline. Aniline 
is converted to 4-aimnophenol by the enzyme, and has an absorption tnaximnm at 630 nm (Gibson 

10 and Skett, supra) . This assay is a conv^ent measure, but und^estimates the total hydroxylatio^ 
which also occurs at the 2- and 3- positions. Assays are performed at 37 ^'C and contain an aliquot of 
the enzyme and a suitable amount of aniline (qyproximately 2 mM) in reaction buffer. F6r this 
reaction, Ac buffer inustcontamNADFH or an NADFH-generating One 
formulation f<x this reaction bufii^ includes 85 mM Tris pH 7 .4, IS mM MgCl2> 50 niM 

15 nicotinamide, 40 mg trisodium isocitrate, and 2 units isodtrate dehydrogenase, with 8 mg NADP*^ 
added to a 10 mL reaction buffer stockjust prior to assay. Reactions are carried out in an optical 
cuvette, and the absorbance at 630 funis measured. The rate of increase in absorbance is proportional 
to the en^me activity in the assay. A standard curve can be constructed using known concentrations 
of 4rammoph^ol. 

20 la,2S-dihydroxyvitamin D 24-hydroxylase activity of DME is detenmned by monitoring the 

convosion of ^I-labeled la,25-dihydroxyvitamm D (la,25(OH)jD) to 24,25-dihydroxyvitamm D 
(24,2S(OH)2D) m transgenic rats expressing DME. 1 fig of la,2S(OH)2D dissolved in ^rnnol (or 
ethanol alone as a contn^ is adnunistexed intravenoosty to approximately 6-week-dd male 
transgenic rats eaqxressing DME or odi^wise identical control rats esqxressing either a defective 

25 variant of DME or not e^qoessing DME. The rats are killed by deoqpitation after 8 hrs, and the 

kidneys are rapidly removed, rinsed, and iKmiogenized in 9 vohnones of ic&cold buffer ( 15 mM Tris- 
acfftatPr (pH 74), 0 1 Q M smrippgfi, 9. mM fn8|gnfyiiim acelata, and 5 mM sodium gmccnialE). Apoidon 
(e.g., 3 ml) of each homoeKiate is then incubated mth 0.2S nM la,25(OH>2[l-^^Hp>, widi a spedfic 
activity of approximately 3.5 GBq/mmol, for 15 mm at 37 under oxygen with constant shaking. 

30 Total liidds aro extracted as desoibed (Bligh, E.G. and W J. Dyer (1959) Can. J. Biochem. Physiol. 
37: 911-917) and the chloroform phase is analyzed by HPLC using a FD^AK STL column (JASOO, 
Tolyo, Japan) with an n-hexane/chloroform^tfaanol (10:2.5: 1 5) solvent system at a flow rate of 1 
ml/mirL hi the alteriiative, the cUoroform phase is andyzed by reverse phase HPIX^usii^ 
SPHERE QDS-AM cohmm (YMC Co. Ltd. , Kyoto, J^an) with an acetoiiitrile buffer system (40 to 

35 100%,in water,in30inin)ataflowrateof Ind/miii. The eluates are collected in fractions of 30 
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secoods (or less) and the amount of % present in each fraction is measured using a scintillation 
counter. By comparing the cbromatQgrams of control sanqples (Le., samples comprising 
la^-dihydioxyvitamin D or 24,2S-dihydn>xyvitamin D (24^(OH)2D)» with the cbzomatqgrams of 
the reaction products, the relative mobilities of the substrate (la»2S(OI])2[l-%QD) and product 
5 (24,2S(OH)2[l-^%[|D) are detemuned and correlated m The amount of 

24^(0£I^[1-^H]D produced in control rats is subtracted from that of transgenic rats expressmg 
DME. The difference in the production of 24^(OH)2[l-^H|D in the transgenic and control animals 
is proportional to the amount of 25-hydrolase activi^ of DME i^esent in the sanyle. Confirmation of 
the identity of the substrate andpn)duct(s) is confirmed by means of mass spectroso^y (Miyamoto, 

10 Y- et aL (1997) J. BioL Chem. 272:14115-14119). 

Flavin-containing monooTcygenase activity of DMB is measured by chromatographic analysis 
of metabolic products. For example. Ring, B J. et aL (1999; Drug Metab. Dis. 27: 1099-1 103) 
incubated FMO in 0. 1 M sodium phosphate buffer (pH 7.4 or 8. 3) and 1 mM NADPH at 37 
stopped the reaction with an organic solvent, and deterinined product foonation fay HPLC 

15 Altmiatively, activity is measured by inonitoringoxygpn uptake using a Qa^^ For 
example, Segler, DJ4. and Poulsm, LX. (1978; Methods Kizymol. 52:142-151) incubated the 
enzyme at 37 C in an NADPH-gpnerating cofactor system (similar to the one described above) 
ccHitaining the substrate niethimazole. The rate ofoxygen uptake is prc^Kirtional to enzyme activity. 
UDP glucuronyltransferase activity of DME is measured using a colorimetric determination 

20 of fiee amine groups (Gibson and Skett supra) . An amine-containing substrate, such as 2- 

aminophenol, is incubated at 37*'C witih an aliquot of die enzyme in a reaction buffer containing the 
necessary co&ctors (40 mM Tris pH 8.0, 7.5 mM MgP,, 0.025% Tiiton X-100, 1 mM ascorbic acid, 
0.75 mMUDP-glucuronic acid). After sufBcienttiine, the leacticn is stopped by addition of ic&rald 
20% trichloroacetic add in 0. 1 M phosphate haSSar pH Z7, incubated on ice, and cratrifuged to 

25 clarify tiie supernatant Any unreacted 2'^minophenol is destroyed in tins stq^. SuCBcientfireshly- 
I»nq>ared sodium nitrite is tiien added; tins step allows f onnation of tibe diazonium salt of the 
glucuronidated {MTodnct Excess nitrite is removed by addition of sufBcieotanmiooium^^ 
the diazonium salt is reacted with an animatic aiinne (for exainple, 

produce a colored azo compound which can be assayed spectrc^hotometricaDy (at' 540 mn, for 
30 example). A standard curve can be constructed using Imown concentrations of aiuline,wM 

form a chromophore with similar properties to 2-anunophenol glucuronide. 

Glutathione S-transferase activity of DME is measured using a model substrate, such as 2,4- 

dinitro-l-chlorob^izene, which reacts with glutathione to form a product, 2,4-dimtropt^nyl- 

glutatMone,fiiathasanabsorbanceri)axiinnmat340nm. It is important to note that GSTs have 
35 differing substrate specifidties, and the model substrate should be selected based on the substrate 



102 



wo 02/46426 



PCT/USOl/47429 



preferences of the GST of intexest Assays are peifanned at anibient tenq)^^ 
aUquot of tbe«azyin& in a suitable leaction buffer (^^ 1 mM glutathione, IniM 

dinitiochloFobraizene, 90 mM potassium phosphate buffer pH 6.S). Reactions are carded out in an 
optical cuvette, and the absoifoance at 340 nm is noeasured. The rate of increase inabsorbanceis 

5 proportional to the enzyme activity in die assay. 

N-acyltransferase activity of DMB is measured using radiolabeled amino acid substrates and 
measuring radiolabel incorporation into conjugated products. Bnzyme is incubated in a reaction 
buffer containing an unlabeled acyl'-CoA conqiound and radiolabeled amino add, and the 
radiolabeled acyl-conjugates are s&psaaAed from Ibe unreacted amino add by extraction into n-butanol 

10 or otb^ appropriate organic solvent For exan^Ie, Johnson, M. R et aL (1990; J. BioL Chem. 

266:10227-10233) measured bile add-CoAranuno acid N-acyltransferase activity by incubating the 
enzyme with cholyl-CoA and ^[-glycine or ^-taurine, separating the tritiated cholate conjugate by 
extraction into n-butanol, and measuring the radioactivity in the extracted product by scintillation. 
Alternatively, N-acyltransf (^rase activity is measured using die spectrophotometric determination of 

15 reduced CoA(CoASH) described below. 

N-acetyltransfeiase activity of DME is measured using the transfer of radiolabel fiom 
[^^acetyl-CoA to a substrate molecule (for exanq>le, see Deguchi, T. (197S) J. Neurochem. 
24:1083-5). Alternatively, a spectrophotometric assay based on DTNB (S,5*-ditfaio-bis(2rnitrobenzoic 
add; Elhnan'sreag^t)reacti(ni with CoASHinay be used. Free thiol-containingCoASH is formed 

20 during N-^u^etyltransferasecatdyzed transfer ofan acetyl groiq> to a su CoASH is detected 

using the absorbance of DTNB conjugate at 412 nm (De AngeUs, I. et al. (1997) J. Biol. Chem. 
273:3045-3050), Enzyme activity is proportional to the rate of radioactivity incorporation into • 
substrate, or die rate of absorbance increase in the spectrophotom^c assay. 

Protein argjlnihemethyltransfetase activity of DME is measured at ""C for various periods 

25 of time. S-adenosyl-Mmethyl-^HJmetMwiine ([^ AdoMet; s^ 

Life Science Products) is used as the methyl-donor substrate. Useful methyl-accepting substrates 
include gtutathione S-transferase fibrillarin gtydne-argfaiine domain fusion protein (GST-GAR), 
homogeneous nuclear ribonucleoprotein (bnRNP), cxr hypome^ylated proteins present in lysates 
from adenosine dialdehyde-treated cells. Methylation reactions are stc^iped by adding SDS-PAGE 

30 sauq)le buffer. The products of the reactions are resolved by SDS-PAGE and visualized by 

fluorography. The presence of ^H4abeled methyl-donor substrates is indicative of protein argmine 
metiiyltransferase activity of DME (Tang, J. et aL (2000) J. Biol. CSiem. 275:7723-7730 and Tang, J. 
et al. (2000) J. Biol. (3iemu275: 19866-19876). 

Catechol-O-methyltransferase activity of DME is measured in a reaction mixture consisting 

35 of 50 mM Tris-HQ (pH 7.4), 1.2 mM MgQj, 200 /iM SAM (S-adenosyl-Lrmetiiionine) iodide 
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(contafauBg 0.5 of iDetfayl-[lP]SAM), 1 mM dithiothreitol, and varying concentratiQns of catechol 
substrate (e.g., L-dopa, dopamine, or DBA) in a final vohune of 1.0 nd. The leaction is mitiated by 
the addition of 250-500 fig of purified DMB or crude DME-containing san^le and perf onned at 37 
"CforSOmin. The reaction is arrested by rapidly cooling on ice and inmiediatefy extract^ 
5 ml of ice-cold n-heptane. Fonowingcentrifugationatl000xgforl0min,3-inlaliqaotsofthe 
organic extracts are analyzed for radioactivity content by liquid scintillation counting. The levd of 
catechol-associated radioactivity in the organic phase is proportional to the catechol-0- 
mediyltransf(»case activity of DME CZhu, B. T. and J. G. Liehr (1996) 271:1357-1363). 

DHFR activity of DME is determined spectrophotometricaUy at 15 *'C by following the 

10 disappearance of NADFH at 340 xosk (£340= 11*800 M'^'cm'O- The standard assay mixture contains 
100 iM NADFH, 14 mM 2-m»captoethanol, MIEN buffer (50 mM 2>moipholinoedianesulfonic 
add, 25 mM tris(hydn>xymetfayl)aminoniethane, 25 mM ethanolamine, and 100 mM NaQ, pH 7.0), 
and DMB in a final volume of 2.0 ml. The reaction is started by the addition of 50 fiM dihydrofolate 
(as substrate). The oxidation of NADFH to NADF corresponds to the reduction of dihydrofolate in 

15 the reaction and is proportional to die amount of DHFR activity in the sanq>le (Nakamura, T. and 
Iwakura. M. (1999) J. Biol. CSiem. 274:19041-19047). 

Aldo/teto reductase activity of DME is measured using the decrease 
as NADFH is consumed. A standard reaction mixture is 135 nM sodium phosphate 
7. 2 dqpendi^g on enzyme), 0.2 mM NADFH, 03 M litiiium sulfate, 0.5-2.5 mg &oxym and an 

20 appropriate level of substrate. The reaction is incubated at 30''C and tiie reaction is monitOTBd 

continuously with a spectrophotometer. Enzyme activity is calculated as mol NADFH consumed /mg 
of ens^yme. 

Alcohol dehydrograiase activity of DME is noeasured using the increase in absorbance at 340 
nm as NAD*^ is reduced to NADH. A standard reaction mixture is 50 mM sodium phosphate, pH 73, 
25 and 0.25 mMEDTA. The reaction is incubated at 25''C and monitored usmg a spectcqpho^ 
Qozyme activity is calculated as mol NADH produced / n^ of ^izyme^ 

Carbos^lesteiase activity of DME is detennined using 4Haai^yluihbellifiBiyl acetate as a 
substrate. The enzymatic reaction is initiated fay addir^g appioxim ately lOftlof DMB^ontaining 
5anQ>le to 1 nil of reacticm buffer (90 nM KH2PO4, 40 iriM KQ, pH 7.3) with 0.5 niM 
30 4-niBthylumbellifecyl acetate at 37''C. The production of4-methylumbelMerone is inoxutored with a 
spectn>photc»netQr (8330 = 12.2 mM"* cm*0 for 1.5 min. Specific activity is expressed as micromoles 
of product formed per minute par milligram of protein and corresponds to the activity of DMB in the 
san^le OBvgenia, V. et aL (1997) J. BioL ChraL 272:14769-14775). 

In die alternative, tt^ cocaine benzoyl est^ hydrolase activity of DMB is measured by 
35 incubating {proximately 0. 1 ml of DME and 3 .3 mM cocaine in reaction buffer (50 mM NaH2P04, 
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pH74)withlnMbenzaimdine, lnMEI>rA»andlih^ Theieacticmis 
mcubated for 1 h in a total volume of 0.4 ml then tenninated with an equal volume of 5% 
trichloroacetic acid. 0.1 ml of the internal standard 3»44imethylbeQzoic acid (10 iigfnA) is added. 
Piecipitated protein is s^aiated by centrifugation at 12,000 x g for 10 min. The supernatant is 
5 transfened to a clean tube and extracted twice with 0.4 ml of mediylenecUori^^ The two extracts 
aie combined and dried undo: a stream of nitrogen. The residue is resuspended in 14% acetonitrile, 
250 mM KH2P04» pH 4.0, with 8 ^1 of dietfaylamine per 100 ml and injected onto a CIS reverse- 
pliaseHPLCcolunm for separation. The column eluate is monitored at 235 ma DME activity is 
quantified by comparing peak area ratios oftheanalyte to the intenial standard. A standard curve is 

10 genearated with benzoic add standards prq»aied in a trichloroacetic add-treated protdn matrix 
(Evgenia, V. et al. (1997) J. Biol. Chem. 272:14769-14775). 

In another alternative, DME carboxyl esterase activity against the water-soluble substrate 
para-nitrophenyl butyric add is determined 1^ spectrophotometric methods well known to those 
skilled in the art Id this procedure, the DME^ontaining samples are diluted with 0.5 M Tris-HQ 

15 (pH7.4Qr8.0)or sodium acetate (pH 5.0) in the presence of 6 niMtaurocholate. Theassayis 
mitiated by adding a fiesUy prepared para-nitrophenyl butyric add solution (1(X) fig^nd in sodium 
acetate, pH 5.0). Carboxyl esterase activity is then maoitoied and conqiared with control 
autohydrolysis of the substrate using a spectrophotometer set at 405 nm (Wan, L. et aL (2000) J. Biol. 
Chem. 275:10041-10046). 

20 SutTotransf^case activity of DME is measured using the incorporation of fiom [^S]PAPS 

into a model substrate such as phenol (Folds, A. and J. L. Meek (1973) Biochim. Biophys. Acta 
327:365-374). An aliquot of enzyme is incubated at 37 X with 1 mL of 10 nM phosphate buffer, pH 
6.4, SO mM i^nol, and 0.44.0 mM t^S] admosine 3'-phosplmte S'-phosphosnlfale (PAPS). Aft^ 
sufBd^ time f or 5-20% of ttie radiolabel to be tnmsferred to tibe 

25 acetate is added to predpitate protein and phosphate bufifer. Then 0.2 mLof0.1MBa(OH)2 is added, 
followed by 0.2 mLZnS04. The supeniatant is cleared by centrifugation, which xomovespr^ 
well as unreacted [^SJPAPS. Radioactivity in the supernatant is measured by sdntfllati<a. The 
enzyme activity is detminned from the number of moles of radioactivity in the reaction product 
Heparan sul&te 6-5uIfotransferase activity ofDME is measured in vitro bv incubating a 

30 sanq)le containing DME alcmg with 2.5 fimol imidazole HQ (pH 6.8), 3.75 pg of protamine chloride, 
25 nmol (as hexosamme) of completely desulfiited and N-^esulfated hq[>arin, and 50 pmol (about 5 x 
lO' cpm) of l^S]adeDOsine S'-phosphate 5'-pho5phosuI&te (PAPS) in a final reaction volume of 50 
ftlat37Xfor20min. The reaction is stq[»ped by immersing the reaction tubes in a boil^ 
badiforlmin. 0.1 ^mol (as ghicuranic add) ofchondroitinsnl£ate A is added to the reaction 

35 mixture as a carrier. ^S-kbeledpoIysaccharidesarepredpitated witii 3 volumes of cold ethanol 
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contaimng 1.3% potassium acetate and separated conqpletely frcHn muncoECporated f^S]PAPS and its 
degradation products by g^l chromatography using desalting columns. One unit of enzyme activity is 
defined as the amount required to transfer 1 pmolofsul£ate/min.»detenmned by the amount of 
[^%]PAPS incorporated into the precipitated polysaccharides (Habucbi, R et al. (1995) J. Biol. 

5 Chem. 270:41724179). 

Jn the alternative, heparan sulfate 6-sulf otransf^rase activity of DMB is measured by 
extraction and renaturation of enzyme from gels following separation by sodium dodecyl sulfate 
polyacrylamide gel electrophoresis (SDS-PAGE). Followii^ separatiosi, the gel is washed with 
bu£G^ (0.05 M Tris-HQ, pH 8.0), cut into 3-5 mm segments and subjected to agitation at 4 with 

10 100 fil of the same buffer containing 0.15 MNaQ for 48 h. The eluted enzyme is collected by 
centrifugation and assayed fox the sulfotransferase activity as described above (Habuchi, H. et aL 
(1995) J. Biol. Ghent 270:4172-4179). 

hi anotiier alternative, DMB sulfotransferase activity is det^cmined by measuring the transfix 
of [^S]su]fate from [^S]PAPS to an inmiobilized peptide that repre^ 15 residues 

15 of the mature P-sdectin glycoprotein ligand-1 polypeptide to which a C-terminal cysteme residue is 
added. The peptide spans three potential tyrosine sulfation sites. The peptide is linked via the 
cysteine residue to iodoacetamide-activated resin at a density of 1.5-3.0 funol peptideAool of resin. 
The enzyme assay is performed by conihining 10 fil of p^tide-derivitized beads with 2-20 ^1 of 
DM&containing sample in 40 inM Pipes (pH 6.8), 03 M NaCl, 20 mM MnQj, 50 mM NaF, 1% 

20 Triton X-100, and lmM5'-AMP in a final volume of 130 fil. Theassay is initiated by addition of 0.5 
lid of [^SjPAPS (1.7 (iM; 1 a = 37 GBq). Aftor 30 mm at 37**C tiie reaction beads are washed 
with 6 M guanidine at 65''C and the radioactivity incorporated into the beads is detmnined by liquid 
scintillation countu^ Transfo of f^jsol&te to the bead-associated pqrtide is measured to 
detennine the DME activity in the saixq[>le. Qneunitof activity is defined as Ipmol of product 

25 f<»3med per iinn(Ouyang,Y.-B.etal. (1998) Biochmustry 95:2896-29^ 

hi another alternative, DME sulfotransferase assays are pexf oacmed using [^SIPAPS as the 
sulfate donor in a fimal volume of 30 containing 50 mM Hqies-NaOH (pH 7 .0), 250 mM sucrose, 
1 mMdiduotineitol, 14 iM^^SJpAPS (15 CSAmmol), anddopainine(25 f(M),/^-ni^ (5 fiM), 
or othor candidate substrates. Assay reactions are started by the addition of a purified DME enzyme 

30 preparation or a san^le containing DME activity, allowed to proceed for 15 min at 37^C, and 

terminated by heating at lOO^'C for 3 irun. The precipitates formed are cleared by centrifiigation. The 
supematants are then subjected to the analysis of ^S-suUated product by either thin-layer 
chromatography or a two-dimensional thin layer separation procedure. Appropriate standards are run 
in parallel witii the supematants to allow the identification of the ^S-suIfated products and determine 

35 the enzyme specificity of the DME-containing sanqples based on relative rates of migration of 
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reaction products (Sakakibara, Y. et al. (1998) J. Biol. Oiem. 273:6242^247). 

Sqoaloie qpoxidase activity of DME is assayed in a mixture conq>risiiig purified DME (or a 
crude mixture comprising DME), 20 mM Tris-HQ (pH IS), 0.01 nM FAD, 0^ unit of 
NADPH-cytochron» C (P-450) reductase, 0.01 mM ["Qsqualene (dispersed with the aid of 20 fil of 

5 Tween 80), and 0.2% Triton X-100. 1 mMNADFH is added to initiate the reaction followed by 
incubation at 37 ""C for 30 min. The nonsaponifuible lipids are analyzed by silica gel TLXi: develop 
with efhyl acetate/benzene (0.5:99.5, v/v). The reaction products are compared to those from a 
reaction mixture without DMR Thepresenceof 23(5)-oxidosqual^ is confirmed using 
appropriate lipid standards (Sakakibara, J. et aL (1995) 270:17-20). 

10 ]^xide hydrolase activity of DME is detendmed by foUowiiig substrate depletio 

chromatognvhic (GQ analysis of ethereal extracts or by following substrate depletion and diol 
production by (K] analysis of reaction inixtiiresqumched in acetra A sample containing DME or 
an epoxide hydrolase control sample is incubated in 10 mM Tris-HQ (pH 8.0), 1 mM 
ethylenediammetetraacetate (EDTA), and 5 mM epoxide substrate (e.g., ethylene oxide, styzene 

15 oxide, propylene oxide, isoprene monoxide, epichloiohydtin, qxibromohydrin, epifluorohydrin, 
gilyddd, 1,2-epoxybutane, 1,2-epoxyhexane, or 1,2-epoxyoctane). A portion of the sanq>le is 
withdrawn from the reaction mixture at various time points, and added to 1 ml of ice-cold acetone 
containing an intenial standard for GC analysis (e.g., 1-xioaanoI). Protein and salts are removed by 
CCTtdfugation (15 min, 4(X)0 X g) and the extract is analyzed by GC usm 

20 CP-Wax57-GB column (CHROMPACK, Middelburg, The Netherlands) and a flame-icmization 
detector. The identification of GC products is perfonned using appropriate st^ 
well known to those skilled in the art 1 unit of DME activity is defined as the amount of enzyme that 
catalyzes the productim of 1 funolof diol/mmC[Unk,ILetaL (1997) J.BioL C3^^ 
272:14650-14657). 

25 Aminotransferase activity of DME is assayed by incubating samples containing DME for 1 

hour at 37°C in tiie presence of 1 iiM I^Jcynurenine an^ 

200 fcl of 150 ndlfTris acetate buffer (pH 8.0) coDtai^^ The formation of kynuremc 

and is quantified by HPIX; with spectrophotoDKtric d^BC^ 

standards and controls weD known to those skilled in the art hi the alternative, 

30 Lr3-hydroxykynurenine is used as substrate and the production of xanthurenic acid is determined by 
HPLC analysis of the products with UV detection at 340 mn. The production of kynurenic acid and 
xanthurenic acid, respectively, is indicative of anunotransferase activity (BucUi, R. et aL (1995) J. 
BioL Ctem. 270:29330^29335). 

In another altranative, aminotransferase activity of DME is measured by detetnmiing the 

35 activity of purified DME or crude samples containing DME toward various andno and oxo acid 
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substrates under single tumov^ conditions by monitoring the chan^ in tbe UV/VIS absorption 
spectrum of the enzynie4x)und cofactor, pyridoxal S -phosphate (PIP). The reactions are performed 
at 2^0 in SO nM 4-methyhnorpholine (pH 7.5) containing 9 fiM puiiiBed DME or DME containing 
sanq>les and substrate to be tested (ammo and oxo acid substrates). The lialf-reaction from ammo 

5 acid to oxo acid is followed by measuring the decrease in absoifoance at 3 60 nm and the increase m 
absorbance at 330 nm due to the conv^on of enzyme-bound FLP to pyridoxamine 5* phosphate 
(EMP). The qjedfidty and relative activity of DMB is deterinmed by the activity of 
preparation against specific substrates (Vacca, ILA. et al. (1997) J. Biol. Cb&iSL 272:21932-21937). 
Siq)eroxide dismutase activity of DME is assayed from cell peDets, culture supematants, or 

10 purified protein piq^arations. Samples or lysates are resolved by electrophoresis on 15% 
non-denaturing polyacrylandde gels. The gels are incubated for 30 min in 2.5 mM nitto blue 
tetrazolium, followed by incubation for 20 min in 30 mM potassium phosphate, 30 iriM TEMED, and 
30 fiM riboflavin 7.8). Superoxide dismutase activity is visualized as white bands against a blue 
background, following illumination of the gels on a ligjbtbox. Quantitation of superoxide dismutase 

15 activity is performed by densitometric scannii^ of the activity gels using the appropriate superoxide 
dismutase positive and negative controls (e.g., various amounts of commercially available R coli 
siqi^xide dismutase (Bbrdi, G. and Horwitz, M. A. (1999) J. Biol. Qiem. 274:4281-4292). 
XVm. Identification of DME Lddbitors 

Compounds to be tested are arrayed in the wells of a mdti-weU plate in varying 

20 concentrations along with an apprc^riate buffer and substrate, as described in the assays m Example 
XVTL DME activity is measured for each well and the ability of each conqK)und to inhibit DME 
activity can be detennined, as well as the dose-iespoiise profiles. This assay could also be used to 
idmtify molecules whidi enhance DME activity. 

25 Various modifications and variations of the described methods and systems of the invention 

will be apparent to those skilled in the art without departing fi:omthe scope and spirit of the 
invratioiL Although die invention has been described in connection witii certain embodi^^ 
should be understood that th& invention as claimed should not be unduly limited to such specific 
embodiments. Indeed, various modifications of the described modes for carrying out the invention 

30 which are obvious to those skilled in molecular biology or related fields are intended to be within the 
scope of the following claims. 
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1. Ad isolated polypeptide selected firom the groiq) CO 

a) a polypeptide conqkiising an amino add sequence selected from the group consisting 
ofSEQIDNO:l-13, 

b) a polypeptide conqinsing a naturally occurring amino acid sequence at least 90% 
identical to an amino acid sequence selected from the group consisting of SEQ ID 
N0:l-4 and SEQ ID NO:6-13, 

c) a polypeptide compris in g a naturally occurring amino add sequence at least 99 % 
identical to the amino acid sequence of SEQ ID NO:5, 

d) a biologically active firagmirait of a polypeptide having an amino add sequmce 
selected from the group consisting of SEQ JD NO: 1-13, and 

e) an immunogenic fragment of a polypeptide having an amino add sequence selected 
fromtiie g^yup consisting of SEQ ID N0:l-13. 

2. An isolated polypeptide of claim 1 coiiqprising an ainino add seqi^^ 
group consisting of SEQ ID NO: 1-13. 

3. An isolated polynucleotide encoding a polypeptide of claim 1. 

4. An isolated polynucleotide encoding a polyp^tide of claim 2. 

5. An isolated polynucleotide of claim4 compdsiiig a polynucleotide sequosce selected from 
the group consisting of SEQ ID NO:14-26. 

6. A recombinant polynuclecrtide comprising a promoter seqnejir^ opaably linked to a 
polynucleotide of claim 3. 

7. A ceUtraiisformed with a recombinant polynudeotide of clai^ 

8. A transgenic organism conq)rising a recombinant polynucleotide of claim 6. 

9. A method of producing a polypeptide of claim 1, the method conqirising: 

a) culturing a ceU und^ conditions suitable for expression of the polypeptide, wherein 
said cell is transformed with a recombinant polynucleotide, and said recombinant 
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polynucleotide coiqpiises a ptomotso' sequence opeiably linked to a polynucleotide 
^coding the polypeptide of claim 1, and 
b) recovering the polypeptide so expressed. 

10. A method of claim 9, wherein the polypeptide con9>rises an amino acid sequence 
selected firom the group consisting of SEQ ID NO: 1-13. 

11. Anisolatedantibody wUchspecificaUy tmds to a polypeptide of claim 1^ 

12. An isolated polynucleotide selected from the group consisting o£ 

a) a polynucleotide corqprising a polynucleotide sequence selected from the group 
consisting of SEQ ID NO: 14-26, 

b) a polynucleotide ccHiqnising a naturally occurring polynucleotide sequence at least 
90% identical to a polynucleotide sequence selected firom the groiq> consisting of 
SEQ ID NO: 14-19, SEQ ID NO:20-26, . 

c) a polynucleotide comprising a naturally occuning polynucleotide sequence at least 
9S% identical to the polynucleotide sequraoce of SEQ ID NO:18, 

d) a polynucleotide conq>l»nentary to a polynucleotide of a), 

e) a polynucleotide complementary to a polynucleotide of b), 

f) a polynucleotide cooDoplementary to a polynucleotide of c), and 
an RNA equivalent of a>f). 

13. An isolated polynucleotide comprisii^ at least 60 contiguous nucl^^ 
polynucleotide of claim 12. 

having a sequence of a polynucleotide of claim 12, tiie m^xxl conqirising: 

a) faybridiznig the sample witfi a probe ccHiqxrising at least 20 ccnliguous nucleotides 
comprising a sequeaace cmnplementaiy to said taig^ polynucleotide in tiie san^le, 
and which probe specifically hybridizes to said target polynucleotide, und^ 
conditions whereby a hytffidization conq>lex is f onned betwera said probe and said 
target polynucleotide fragments thereof, and 

b) detecting the presence or abs»ce of said hybridization conqplex, and, optionally, iF 
present, the a iroinit thereof. 
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IS. A method of claim 14, wbecein the probe comprises at least 60 ccmtigaoos nucleotides. 



16. A method of detecting a taiget polynucleotide in a saii^le, said taigetp^^ 
having a seqpimce of a polynucleotide of claim 12, the method coursing: 

a) anq>lifying said target polynucleotide or fragment thereof using polymerase chain 
reaction anq^lificatioo, and 

b) detectmg the presence or absence of said amplified target polynucleotide or firagment 
thereof, and, optionally, if present, the amount thereof. 

17. A compositicHi ccnqirising a polypeptide of claim 1 and a pharmaceutically acceptable 
excipient 

18. A composition of claim 17, wherein the polypeptide coiqnises an amino add sequence 
selected from the group consisting of SEQ ID NO: 1-13. 

19. A method for treating a disease or conditicm associated with decreased egression of 
functional DMB, comprising administering to a patient in need of such treatmirait the composition of 
claim 17. 

20. Amethodof screening a compound for effectiveness as an agonist of a polyp 
claim 1, the inelfaod conqnisiDg: 

a) e7qx)sing a sample comprising a polypeptide of daim 1 to a compound, and 

b) detecting agonist activity in the sample. 

21. A cQiiq)ositian ccMmprising an agonist conq^^ 
pl^nnaceutically acceptable exdpient 

22. A inethod for treatiiig a disease or ccHiditicHi associated with decreased e]qness^ 
fimctional DME, coiiqirimig adiniiiistering to a patient i^ 

claim 21. 

23. Amethodof soDeening a conqpound for effectiveness as an antagonist of a polyi^ 
claim 1, die nsethod con^rising: 

a) exposing a sanq>le conqxrising a polypeptide of claim 1 to a conqxiund, and 

b) detecting antagonist activity in the sample. 
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24. A co!iiqx>sition conqnising an antagtnust ra^^ 
aad a phannaceutically acceptable excipient 

25. A method for treating a disease or condition associated wi 

DME, comprising admfaustering to a patient in need of such treatmmt a composition of claim 24 . 

26. Ameihodof scieeningforacoaq)Oundthatq)ecificaUybmdstothepol^ 
1» the method conqnising: 

a) combining the polypeptide of claim 1 with at least one test conQ>ound under suitable 
conditions, and 

b) detectXDg binding of the polypeptide of claim 1 to the test conqx>und, thereby 
identifying a conq>ound that specifically binds to the polypeptide of claim 1. 

27. Amethodof screening for a conqK)und that modulates the activity of the 
claim 1, the method conqnising: 

a) combining the polypeptide of claim 1 with at least one test c(»npound under 
conditions permissive for the activity of the polypeptide of claim 1» 

b) assessing the activity of the polypeptide of claim 1 in the presence of the test 
compound, and 

c) conqmiingtheactivity of the polypeptide of claim 1 in the presence of the tes^ 



28. Ametiiodof screeniiigaconq)OundforeffectiveDessinalt»i^ 
polynucleotide, ^^rfieiein said target polynucleotide comprises a sequence of claim S, die method 



CQiiq)ound with the activity of the polypeptide of claim 1 in the absence of die test 
conqK>und, whradn a change in the activity of the polypeptide of claim 1 in ti^ 
presence of the test conyound is indicative of a conyound that modnl ates the activity 
of the polypeptide of claim 1. 




exposing a sample comprising the target polynucleotide to a conqxmnd, muier 
conditions suitable for the e)q)ression of the target polynucleotide, 
detecting altered expression of the target polynucleotide, and 
conq>aring the expression of the target polynucleotide in the presence of varying 
amoimfg of the CQoqpound and in the absence of the compound. 



b) 
c) 



29. Ametbodof assessiiig toxicity of a test c(nxqK)und, the ineihodcoiiq>risin^ 
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treatmg a biological sample containing nucleic acids with the test compound, 
hybridizing the nucleic acids of the treated biological san^le with a probe comprising 
at least 20 contiguous nucleotides of a polynucleotide of claim 12 under conditions 
whereby a specific hybridization conq)lex is formed between said probe and a target 
polynucleotide in the biological sample, said target polynucleotide comprising a 
polynucleotide sequence of a polynucleotide of claim 12 or fiagment tfaaieof, 
quantifying the amount of hybridization conoplex, and 

con:q)aring the amount of hybridization con^lex in the treated biological san[q>le with 
the amoimt of hybridization con^lex in an untreated biological sample, wherein a 
difference in the amount of hybridization conq>lex in the treated biological sample is 
indicative of toxicity of the test conq>ound. 

30. A diagnostic test for a condition or disease associated with the expression of DME in a 
biological sample, the m&thod conqmsing: 

a) combining the biological sanq)le with an antibody of claim 1 1 , under conditions 
suitable for the antibody to bind the polypeptide and form an antibody:polypeptide 
con^lex, and 

b) detecting the conq)lex, wherein the presence of the con[5)lex correlates with the 
presence of the polypeptide in the biological san^le. 

31. The antibody of claim 1 1, wherein the antibody is: 

a) a chiiDimc antibody, 

b) a single chain antibody, 

c) aFab fragment, 

d) a F(ab')2 fragment, or 

e) a humanized antibody. 

32. AcQnqK>siti£mcompria]iganantibody of claimll 

33. A method of diagnosing a condition or disease associated with the ^qiression of 

a subject, compising administering to said subject an effective amount of the composition of claim 
32. 

34. A coiiqx)sition of claim 32, i^dierein the antibody is labeled. 

35. Ainethodof diagnosiiig a condition or disease associated with the expression of DM^ 
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b) 



c) 
d) 
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a subject, conq>rising adniMstermg to said subject an effective amount ot the conq)osition of claim 

34. 



36. A method of ]piepaiing a polyclonal antibody with the specificity of the antibody of claim 
5 11, the method comprising: 

a) * immuniong an animal with a polypeptide consisting of an amino acid sequence 

selected from the group consisting of SEQ ID NO: 1-13, or an immunogenic fragment 
thereof, under conditions to elicit an antibody response, 

b) isolating antibodies from said animal, and 

10 c) screening the isolated antibodies with the polypeptide, thereby id^tifying a 

polyclonal antibody which binds specifically to a polypeptide comprising an ammo 
acid sequence selected from the group consisting of SEQ ID NO: 1-13. 

37. A polyclonal antibody produced by a method of claim 36. 

15 

38. A com^sition conq>rising the polyclonal antibody of claim 37 and a suitable carrier. 



39. A method of making a monoclonal antibody with the specificity of the antibody of claim 
11, the method comprising: 

20 a) immunizing an animal with a polypeptide consisting of an amino add sequence 

selected from the group consisting of SEQ ID NO: 1-13, or an immunogenic fragment 
thereof, under conditions to elicit an antibody response, 

b) isolating antibody producing ceQs from the animal, 

c) fusing the antibody producing ceUs with immortalized cells to form monoclonal 
25 antibody-producing hybridoma cells, 

d) culturing the hybridoma cells, and 

e) isolating from the cultnieinonoclonal antibody wldch binds spedficaU 
polypqytide comprising an amino add sequence sdected from the gcmp consisting of 
SEQIDNO:M3. 

30 

40. A monoclonal antibody produced by a method of claim 39. 



41. A conqK)siti(ni comptismg the mcmoclonal antibody of claim 40 and a suitable carrier. 



35 



42. The antibody ofclahn 11, wheiem the antibody is produced by screening a Fab 
expression lihraiy. 
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43. The antnxMty of claim llyWhecein the antibody is prodi^ 
inmnmoglobiilin library. 



44. A loethod of detecting a polypeptide conq>rishig an amino acid sequence selected from 
5 the group consistmg of SEQ ED NO: 1-13 in a sanq>le, the method conqprising: 

a) incubating the antibody of claim 1 1 with a sanq)le under conditions to allow specific 
binding of the antibody and tb& polypeptide, and 

b) detecting specific binding, wherein specific binding indicates the presence of a 
polypeptide conq>rising an amino acid sequence selected bam the groiqi consisting of 

10 SEQ ID NO: 1-13 in the sanq>Ie. 

45. A method of purifying a polypeptide conq>rising an amino acid sequence selected from 
the group consisting of SEQ ID NO: 1-13 from a sample, the method conqnising: 

a) incubatiag the antibody of claim 1 1 with a sanq>le under conditions to allow specific 
15 broding of the antibody and the polypeptide, and 

b) separating the antibody fiiom the sanq>le and obtaining the purified polypeptide 
conqprising an amino acid sequence selected from the group consisting of SEQ ID 
N0:l-13. 

20 46. A nucroauay wherein at least one element of the microairay is a polynucleotide of claim 

13. 

47. Amsthodof generating an ^q;)!re5sion profile of a saiiQ>lewMchcQnta^ 
polynucleotides, the method conqnising: 

25 a) labeling the polynuclec^des of the san^le, 

b) • contactingtheelementsoftiieniicroarray of claim 46 with the labeled 

polynucleotides of die san^Ie under conditi(His suitable for the formation of a 
hybiidizatim complex, and 

c) quantifying the ejq[xression of the polynucleotides in the san^ 

30 

48. An array conq>rising different nucleotide nK^lecules affixed in distinct physical locations 
on a soUd substrate, wherein at least one of said nucleotide molecules comprises a first 
oligonucleotide or polynucleotide sequence specifically hybridizable with at least 30 contiguous 
nucleotides of a target polynucleotide, and wherein said target polynucleotide is a polynucleotide of 

35 claim 12. 
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49. An array of claim 48, whaein said first oligonucleotide or polynucleotide sequence is 
con^letely complementary to at least 30 contiguous nucleotides of said target polynucleotide. 



50. An array of claim 48, wh^nein said first oligonucleotide or polynucleotide sequence is 
5 conQ)letely conq>lementary to at least 60 contiguous nucleotides of said target polynucleotide. 

51. An array of claim 48, wherein said first oligonucleotide or polynucleotide sequence is 
completely complementary to said tar;^ polynucleotide. 

10 52. Anarray ofclaim48, whichisanoicroarray. 

53. An array of claim 48, further con^Mising said target polynucleotide hybridized to a 
nucleotide molecule conq)rising said first oligonucleotide or polynucleotide sequence. 

15 54. An array of claim 48, wherein a linl^r joins at least one of said nucleotide molecules to 

said solid substrate. 

55. An array of claim 48, wherein each distinct physical location on the substrate contains 
multiple nucleotide ]iK>]ecule5, and the multiple nucleotide molecules at any single distinct physical 

20 location have the same sequence, and each distinct physical location on the substrate c(xitains 

nucleotide molecules having a sequence which differs firom the sequence of nucleotide molecules at 
another distinct physical location on the substrate. 

56. A polypq>tide of claim l,con(qirising the amino acid sequence of SBQ ID NO: I. 

25 

57. A polypeptide of claim I, conqnising the anuno acid sequence of SEQ ID NO:2. 

58. A polypeptide of claim 1 , comprising the amino acid sequence of SEQ ID NChS. 
30 59. A polypeptide of claim 1, conymsing ttie amino acid sequence of SEQ ID NO:4. 

60. Apolypeptideof claim l,conq>rising the amno acid seqneiK:ecf SEQ n)N^^ 

61. Apolypq>tideof claim l,conqHising the amino add seqiiQDce of SEQ ID NO:^^ 

35 

62. A polypeptide of claim 1, conq>rising the amino acid sequence of SEQ ID NO:7. 
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63. A polypeptide of claim 1, conqndsing the amino add sequsoice of SEQ ID N0:8. 

64. A polypeptide of claim l,conqxrising the aintno add seq 

5 65. A polypeptide of claim 1, conq)risiiig the aniino add sequence of SEQ ID NO: 10. 

66. Apolypeptideof claim l,conq>rismg the amino add sequence of SEQ ID 

67. A polypeptide of claim 1, conq>rising the amino add sequence of SEQ ID NO: 12. 

10 

68. A polypeptide of claim 1, conoprising the amino add sequence of SEQ ID NO: 13. 

69. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:14. 

15 

70. A polynucleotide of claim 12, conq>rising the polynucleotide sequence of SEQ ID 

NO: 15. 

71. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

20 NO:16. 

72. A polynucleotide of claim 12, conqmsing the polynucleotide sequence of SEQ ID 

NO:17. 

25 73. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:18. 

74. A polynucleotide of daim 12, ccmgnising the polynucleotide seqneace of SEQ ID 

NO:19. 

30 

75. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:20. 

76. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

35 NO:21. 
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77. A polynucleotide of claim 12, comprisiiig the polynucleotide sequence of SEQ ID 

NO:22. 

78. A polynucleotide of claim 12, cooqirising the polynucleotide sequence of SEQ ID 

5 NO:23. 

79. A polynucleotide of claim 12, conqiri^ng the polynucleotide sequence of SEQ ID 

NO:24. 

10 80. Apolynucleotideof claim 12, coniprisuigtbB polynucleotide sequence of SEQ ID 

NO:25. 

81. A polynucleotide of claim 12, comprising tbe polynucleotide sequence of SEQ ID 

NO:26. 
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<110> INCYTE GENOMICS, INC. 
SANJANWAliA, Madhu M. 
YAO, Honigue G. 
AU- YOUNG, Janice 
BAUGHN, Mariah R. 
ARVIZ0, Chandra 
RING, Uuijim Z. 
LEE, Ernestine A. 
DING, Li 

HAFALIA, April J. A. 
TANG, y. Tom 
HOE, Henry 

TRIBOULEY, Catherine M. 

LU, Dyung Aina 

LAL, Preeti G. 

WARREN, Bridget A. 

YANG, Junming 

WALIA, Narinder K. 

NGUYEN, Danniel B. 

GANDHI, Ameena R. 

LU, Yan 

ISON, Craig H. 

<120> DRUG METABOLIZING ENZYMES 

<130> PI^0313 PCT 

<140> To Be Assigned 
<141> Herewith 

<150> 60/254,308; 60/256,189; 60/257,713; 60/262,706 
60/266,020 

<151> 2000-12-08; 2000-12-15; 2000-12-21; 2001-01-19 
2001-02-02 



<160> 26 

<170> PERL Program 

<210> 1 
<211> 494 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: 7483477CD1 

<400> 1 

Met Glu Leu Gly Gly Ala Val Thr He Phe Leu Ala Leu Arg Leu 
15 10 15 

Ser Cys Leu Leu lie Leu He Ala Trp Lys Arg Met Asp Lys Ala 

20 25 30 

Gly Lys Leu Pro Pro Gly Pro Thr Pro He Pro Phe Leu Gly Asn 

35 40 45 

Leu Leu Gin Val Arg Thr Asp Ala Thr Phe Gin Ser Phe Met Lys 
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50 55 60 

Leu Arg Glu Lys Tyr Sex Pro Val Phe Thr Val Oyr Met Gly Pro 
65 70 75 

Arg Pro Val Val Val Leu Cys Gly His Glu Ala Val Lys Glu Ala 
80 85 90 

Leu He Asp Gin Ala Asp Glu Phe Ser Gly Arg Gly Glu Leu Ala 
95 100 105 

Ser He Lys Gin Asn Phe Gin Gly His Gly Val Ala Leu Ala Asn 

110 115 120 

Gly Glu Arg Trp Arg He Leu Arg Arg Phe Ser Leu Thr He Leu 

125 130 135 

Arg Asp Phe Gly Met Gly Lys Gin Ser He Lys Glu Arg He Gin 

140 145 150 

Glu Glu Ala Ser Tyr Leu Leu Glu Glu Phe Gin Lys Thr Lys Gly 

155 160 165 

Ala Pro He Asp Pro He Phe Leu Leu Ser 3U:g Thr Val Ser Asn 

170 175 180 

Val He Ser Ser Val Val Phe Arg Ser Arg Phe Asp Tyr Glu Asp 

185 190 195 

Lys Gin Phe Leu Asn Leu Leu Arg Leu He Asn Glu Ser Phe He 

200 205 210 

Glu Met Ser Thr Pro Trp Ala Gin Leu Tyr Asp Met Tyr Ser Gly 

215 220 225 

He Met Gin Tyr Leu Pro Gly Arg His Asn Leu He Tyr Tyr Leu 

230 235 240 

Val Glu Glu Leu Lys Asp Phe He Ala Ser Arg Val Lys He Asn 

245 250 255 

Glu Ala Ser Phe Asp Pro Gin Asn Pro Arg Asp Phe He Asp Cys 

260 265 270 

Phe Leu He Lys Met His Gin Asp Lys Asn Asn Pro Arg Thr Glu 

275 280 285 

Phe Asn Leu Lys Asn Leu Val Leu Thr Thr Leu Asn Leu Phe Phe 

290 295 300 

Ala Gly Thr Glu Thr Veil Ser Ser Thr Leu Arg Tyr Gly Phe Leu 

305 310 315 

Leu Leu Met Lys His Pro Glu Val Glu Ala Lys He His Glu Glu 

320 325 330 

He Asn Gin Val He Gly Pro His Arg Leu Pro Arg Val Asp Asp 

335 340 345 

Arg Val Lys Met Pro Tyr Thr Asp Ala Val He His Glu He Gin 

350 355 360 

Arg Leu Val Asp He Val Pro Met Gly Val Pro His Asn Leu He 

365 370 375 

Arg Asp Thr Gin Phe Arg Gly Tyr Leu Leu Pro Lys Gly Thr Asp 

380 385 390 

Val Phe Pro Leu Leu Gly Ser Val Leu Lys Asp Pro Lys Tyr Phe 

395 400 405 

Arg Tyr Pro Asp Ala Phe Tyr Pro Gin His Phe Leu Asp Glu Gin 

410 415 420 

Gly Arg Phe Lys Lys Asn Glu Ala Phe Val Pro Phe Ser Ser Gly 

425 430 435 

Lys Arg He Cys Leu Gly Glu Ala Met Asp Arg Met Glu Leu Phe 

440 445 450 

Leu Tyr Phe Thr Ser Thr Leu Gin Asn Phe Ser Leu His Ser Leu 

455 460 465 

Val Pro Pro Val Asp He Asp He Thr Pro Lys Leu Ser Gly Phe 
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470 



475 



480 



Gly Asn lie Pro Pro Thr Tyr Glu Leu Cys Leu Val Ala Arg 



<210> 2 

<211> 243 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> incjyte ID No: 7485159CD1 

<400> 2 

Met Ser Gly Asp Ala Thr Arg Thr Leu Gly Lys Gly Ser Gin Pro 
15 10 15 

Pro Gly Pro Val Pro Glu Gly Leu He Arg He Tyr Ser Met Arg 
20 25 30 

Phe Cys Pro Tyr Ser His Arg Thr Arg Leu Val ,Leu Lys Ala Lys 
35 40 45 

Asp He Arg His Glu Val Val Asn He Asn Leu Arg Asn Lys Pro 
50 55 60 

Glu Trp Tyr Tyr Thr Lys His Pro Phe Gly His He Pro Val Leu 
65 70 75 

Glu Thr Ser Gin Cys Gin Leu He Tyr Glu Ser Val He Ala Cys 
80 85 90 

Glu Tyr Leu Asp Asp Ala Tyr Pro Gly Arg Lys Leu Phe Pro Tyr 
95 100 105 

Asp Pro Tyr Glu Arg Ala Arg Gin Lys Met Leu Leu Glu Leu Phe 
110 115 120 

Cys Lys Val Pro His Leu Thr Lys Glu Cys Leu Val Ala Leu Arg 
125 130 135 

Cys Gly Arg Glu Cys Thr Asn Leu Lys Ala Ala Leu Arg Gin Glu 
140 145 150 

Phe Ser Asn Leu Glu Glu He Leu Glu Tyr Gin Asn Thr Thr Phe 
155 160 165 

Phe Gly Gly Thr Cys He Ser Met He Asp Tyr Leu Leu Trp Pro 
170 175 180 

Trp Phe Glu Arg Leu Asp Val Tyr Gly He Leu Asp cys Val Ser 
185 190 195 

His Thr Pro Ala Leu Arg Leu Trp He Ser Ala Met Lys Trp Asp 
200 205 210 

Pro Thr Val Cys Ala Leu Leu Met Asp Lys Ser He Phe Gin Gly 
215 220 225 

Phe Leu Asn Leu Tyx Phe 6ln Asn Asn Pro Asn Ala Phe Asp Phe 
230 235 240 

Gly Leu Cys 



<210> 3 

<211> 873 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 



485 



490 
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<223>. Incyte ID No: 7485518CD1 
<400> 3 

Met Asn lieu lie Val Lys Leu Arg 

1 5 
Val Leu Leu Ala Thr Phe Cys Leu 
20 

Tyr Phe Leu Ty^r Ser Gly Tyr Lys 

35 

Thr Thr Ala Glu Ala Glu Cys Thr 
50 

Arg Ser Met Glu Leu Lys Thr Val 
65 

Thr Asp Pro Thr Val Leu Leu Phe 
80 

Leu Gly Gin Asp He Xle Ala He 
95 

Tyr His Met Val He Ala Pro Gly 
110 

Thr Asp Asn Gly Lys Gly Lys Tyr 
125 

He Leu Lys Tyr Val Ser Met Asp 
140 

Glu Lys Tyr Cys Val Glu Tyr Ser 
155 

Lys Ala Asn Glu Asn Ser Leu Pro 

170 

Pro Leu Asn Leu Phe Asn Asn Leu 
185 

Asn Pro Gin Ser Pro Leu Leu His 
200 

Glu Lys Gly Pro Leu Pro Gly Glu 

215 

Asn His Ser Thr Tyr Gin Pro Val 
230 . 

Glu Lys Ser Leu Ser Ser Leu Ser 

245 

Val He Gin Asp Leu Gly Leu His 
260 

Phe Gly Asn Asn Leu Asn Phe Trp 
275 

Asp Ala He Ser Phe Leu Ser Gly 
290 

Asp Arg Tyr He Leu Val Asp He 
305 

Glu Gly Thr Arg Met Asn Val Lys 

320 

Glu Thr Gin Asn Leu Leu Arg Thr 
335 

Asn Leu Gly Phe Ser Gly Lys Phe 

350 

Glu Asp Glu Gly Asp Asp Leu Leu 
365 

Trp Trp Phe Pro His Met Trp Ser 

380 

His Asn Glu Ser Ser Leu Val Glu 
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Arg Ser 
10 

Val Ser 
25 

Gin Glu 
40 

Asp He 
55 

Lys Pro 
70 

Val Glu 
85 

Leu Glu 

100 
Lys Gly 

115 
Thr Leu 

130 
Ser Trp 

145 
Val Ser 

160 
Ser Thr 

175 
Ala Leu 

190 
He Thr 

205 
Asp Trp 

220 
Leu Leu 

235 
Ser Lys 

250 
Asp Gly 

265 
Leu His 

280 
Lys Arg 

295 
Asp Asp 

310 
Asp Val 

325 
Gin Val 

340 
Tyr His 

355 
Leu Arg 

370 
His Met 

385 
Gin Met 



Phe Arg 
He Val 
Met Thr 
Lys He 
He Asp 
Ser Gin 
Ser Ser 
Asp He 
Val He 
Asn Arg 
He He 
Gin Leu 
Lys Asp 
Lys Ala 
Thr He 
Thr Glu 
Thr Leu 
He Gin 
Lys Leu 
Leu Thr 
He Phe 
Lys Val 
Ala Asn 
Thr Gly 
Ser Val 
Gin. Pro 
He Leu 



Thr Leu He 
15 

He Ser Ala 
30 

Leu He Glu 
45 

Leu Pro Tyr 
60 

Thr Ser Lys 
75 

Tyr Ser Gin 
90 

Arg Phe Gin 
105 

Pro Pro Leu 

120 

Tyr Glu Asn 
135 

Glu Leu Leu 
150 

Gly Phe His 
165 

Lys Gly Phe 
180 

Cys Phe Val 
195 

Pro Lys Val 
210 

Phe Gin Tyr 

225 

Leu Gin Thr 
240 

Phe Ala Thr 

255 

Arg Val Leu 
270 

He Phe He 
285 

Leu Ser Leu 
300 

Val Gly Lys 
315 

Ser Leu Phe 

330 

Phe Thr Phe 
345 

Thr Glu Glu 
360 

Asp Glu Phe 
375 

His Leu Phe 

390 

Asn Lys Glu 
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Phe Ala Leu 
Ala Pro His 
Ala Ala Trp 
Glu Tyr Pro 
His Asn Ser 
Thr His Thr 
Leu Asp Lys 
Leu Asn Pro 
Asn Asp Arg 
Val Gin Ser 
Gin Leu Ala 
Pro Leu Tarp 
Trp Ser Arg 
lie Gly Pro 
Leu Met His 
Phe Glu Glu 
lie Asp Trp 
Ser Asp Phe 
Val Ala Pro 
Leu Thr lie 
Gin His Gin 
Phe His Glu 
Arg Ala Leu 
His lie Glu 
Val Leu Asp 
Asp Met Val 
His Lys Thr 
Leu Leu Glu 



395 
Glu His Gly 

410 
His Ser Gly 

425 
Lys Lys Val 

440 
His Leu Lys 

455 
He Met Val 

470 
He Phe Tyr 

485 
Ser He Arg 

500 
He Ser He 

515 
Leu Gly Leu 

530 

Tirp Thr Asn 

545 
His Gin Tyr 

560 

Gin Asn Pro 

575 
Glu Lys Thr 

590 

Gin Lys Thr 
605 

Pro Ser He 
620 

Val Gin Phe 
635 

Tyr Met Asp 
650 

Tyr Phe Glu 

665 

Arg Arg Ala 
680 

Leu He Asn 

695 
Arg Ala His 

710 
Val He Thr 

725 
Gin Asn Arg 

740 
Arg Trp Leu 

755 
Gly Lys Leu 

770 
Gin Lys Phe 

785 
Leu Ala Phe 

800 
Gly Gly Lys 



He Pro He 
Val Tyr Pro 
Trp Gly He 
Pro Ala Arg 
Leu Pro Arg 
Lys Glu Tyr 
Gly Gly Glu 
Phe Met- Thr 
Tyr Thr Phe 
Leu Lys Leu 
Phe Glu Leu 
Cys Asp Asp 
Cys Asp His 
Gly Thr Thr 
He Ser Asn 
Phe Asn Gly 
Phe Phe Pro 
Lys Ser Ala 
Ala Ala Leu 
Pro Ala Asp 
Asp Asp Pro 
Ala Gly Ser 
Cys Leu Val 
Ser Ala Tyr 
Leu Arg Thr 
Leu Gly Val 
Asp Pro Lys 
Thr Lys Cys 



400 

Asn Met Gly 
415 

Val His He 
430 

Gin Val Thr 
445 

Tyr Arg Lys 
460 

Gin Thr Cys 
475 

Pro Gly Gly 
490 

Leu Phe Leu 
505 

His Leu Ser 
520 

Val Asn Leu 

535 

Gin Thr Leu 
550 

Phe Pro Glu 
565 

Lys Arg His 
580 

Leu Pro Lys 

595 

Ala Leu Tyr 
610 

Leu Pro Ser 
625 

Asn Asn Tyr 
640 

He Pro Ser 
655 

Asn Tyr Phe 
670 

Leu Pro Lys 
685 

Arg Ala Tyr 
700 

Val Ala Leu 
715 

Asp Ala Ser 
730 

Pro Gly Trp 
745 

His Ala Asn 
760 

Glu Pro Ala 
775 

Thr Asn Thr 
790 

Lys Gly Phe 
805 

Leu Gly Lys 



405 

Tyr Ala Val 
420 

Gin Leu Tyr 
435 

Ser Thr Glu 
450 

Gly Phe He 
465 

Gly Leu Phe 
480 

Pro Gin Glu 
495 

Thr He Leu 
510 

Asn Tyr Gly 
525 

Val Asn Phe 

540 

Pro Pro Val 
555 

Gin Lys Asp 
570 

Lys Asp He 
585 

Phe Leu Val 
600 

Leu Phe Leu 
615 

Pro Lys Thr 
630 

His Lys Gly 
645 

Asn Thr Thr 
660 

Asp Ser Glu 

675 

Ala Lys Val 
690 

Ser Trp Tyr 
705 

Lys Tyr Thr 
720 

Ser Lys Leu 
735 

Tyr Ala Thr 

750 

Gin Val Leu 
765 

Lys Val Met 
780 

He Asp Tyr 
795 

Trp Cys Gin 
810 

Ser Lys Gly 
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815 



820 



825 



Arg Lys Tyr Pro Glu Met Asp Leu Gin Ser Arg Ala Phe Leu Lys 
830 835 840 

Asp Tyr Tyr Arg Asp His Asn lie Glu Leu Ser Lys Leu Leu Tyr 
845 850 855 

Lys Met Gly Gin Thr Leu Pro Thr Trp Leu Arg Glu Asp Phe Gin 
860 865 870 

Asn Thr Arg 



<210> 4 

<211> 581 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 2860635CD1 

<400> 4 

Met Trp Gly Arg Thr Ala Arg Arg Arg Cys Pro Arg Glu Leu Arg 
15 10 15 

Arg Gly Arg Glu Ala Leu Leu Val Leu Leu Ala Leu Leu Ala Leu 
20 25 30 

Ala Gly Leu Gly Ser Val Leu Arg Ala Gin Arg Gly Ala Gly Ala 
35 40 45 

Gly Ala Ala Glu Pro Gly Pro Pro Arg Thr Pro Arg Pro Gly Arg 
50 55 60 

Arg Glu Pro Val Met Pro Arg Pro Pro Val Pro Ala Asn Ala Leu 
65 70 75 

Gly Ala Arg Gly Glu Ala Val Arg Leu Gin Leu Gto Gly Glu Glu 
80 85 90 

Leu Arg Leu Gin Glu Glu Ser Val Arg Leu His Gin lie Asn lie 
95 100 105 

Tyr Leu Ser Asp Arg lie Ser Leu His Arg Arg Leu Pro Glu Arg 

110 115 120 

Trp Asn Pro Leu Cys Lys Glu Lys Lys Tyr Asp Tyr Asp Asn Leu 

125 130 . 135 

Pro Arg Thr Ser Val lie lie Ala Phe Tyr Asn Glu Ala Trp Ser 

140 145 150 

Thr Leu Leu Arg Thr Val Tyr Ser Val Leu Glu Thr Ser Pro Asp 

155 160 165 

lie Leu Leu Glu Glu Val lie Leu Val Asp Asp Tyr Ser Asp Arg 

170 175 180 

Glu His Leu Lys Glu Arg Leu Ala Asn Glu Leu Ser Gly Leu Pro 

185 190 195 

Lys Val Arg Leu lie Arg Ala Asn Lys Arg Glu Gly Leu Val Arg 

200 205 210 

Ala Arg Leu Leu Gly Ala Ser Ala Ala Arg Gly Asp Val Leu Thr 

215 220 225 

Phe Leu Asp Cys His Cys Glu Cys His Glu Gly Trp Leu Glu Pro 

230 235 240 

Leu Leu Gin Arg He His Glu Glu Glu Ser Ala Val Val Cys Pro 

245 250 255 

Val He Asp Val He Asp Trp Asn Thr Phe Glu Tyr Leu Gly Asn 



260 



265 



270 
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Ser Gly Glu Pro Gin He Gly Gly Phe Asp Trp Arg Leu Val Phe 

275 280 285 

Thr Trp His Thr Val Pro Glu Arg Giu Arg He Arg Met Gin Ser 

290 295 300 

Pro Val Asp Val He Arg Ser Pro Thr Met Ala Gly Gly Leu Phe 

305 310 315 

Ala Val Ser Lys Lys Tyr Phe Glu Tyr Leu Gly Ser Tyr Asp Thr 

320 325 330 

Gly Met Glu Val Trp Gly Gly Glu Asn Leu Glu Phe Ser Phe Arg 

335 340 345 

He Trp Gin Cys Gly Gly Val Leu Glu Thr His Pro Cys Ser His 

350 355 360 

Val Gly His Val Phe Pro Lys Gin Ala Pro Tyr Ser Arg Asn Lys 

365 370 375 

Ala Leu Ala Asn Ser Val Arg Ala Ala Glu Val Trp Met Asp Glu 

380 385 390 

Phe Lys Glu Leu Tyr Tyr His Arg Asn Pro Arg Ala Arg Leu Glu 

395 400 405 

Pro Phe Gly Asp Val Thr Glu Arg Lys Gin Leu Arg Asp Lys Leu 

410 415 420 

Gin Cys Lys Asp Phe Lys Trp Phe Leu Glu Thr Val Tyr Pro Glu 

425 430 435 

Leu His Val Pro Glu Asp Arg Pro Gly Phe Phe Gly Met Leu Gin 

440 445 450 

Asn Lys Gly Leu Thr Asp Tyr Cys Phe Asp Tyr Asn Pro Pro Asp 

455 460 465 

Glu Asn Gin He Val Gly His Gin Val He Leu Tyr Leu Cys His 

470 475 480 

Gly Met Gly Gin Asn Gin Phe Phe Glu Tyr Thr Ser Gin Lys Glu 

485 490 495 

He Arg Tyr Asn Thr His Gin Pro Glu Gly Cys He Ala Val Glu 

500 505 510 

Ala Gly Met Asp Thr Leu He Met His Leu Cys Glu Glu Thr Ala 

515 520 525 

Pro Glu Asn Gin Lys Phe He Leu Gin Glu Asp Gly Ser Leu Phe 

530 535 540 

His Glu Gin Ser Lys Lys Cys Val Gin Ala Ala Arg Lys Glu Ser 

545 550 555 

Ser Asp Ser Phe Val Pro Leu Leu Arg Asp Cys Thr Asn Ser Asp 

560 565 570 

His Gin Lys Trp Phe Phe Lys Glu Arg Met Leu 

575 580 



<210> 5 

<211> 835 

<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 2530615CD1 



<400> 5 

Met Trp Leu Leu Ala Leu Cys Leu Val Gly Leu Ala Gly Ala Gin 
15 10 15 

Arg Gly Gly Gly Gly Pro Gly Gly Gly Ala Pro Gly Gly Pro Gly 
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20 25 30 

Leu Gly Leu Gly Ser Leu Gly Glu Glu Arg Phe Pro Val Val Asn 
35 40 45 

Thr Ala Tyr Gly Arg Val Arg Gly Val Arg Arg Glu Leu Asn Asn 
50 55 60 

Glu lie Leu Gly Pro Val Val Gin Phe Leu Gly Val Pro Tyr Ala 
65 70 75 

Thr Pro Pro Leu Gly Ala Arg Arg Phe Gin Pro Pro Glu Ala Pro 
80 85 90 

Ala Ser Tirp Pro Gly Val Arg Asn Ala Thr Thr Leu Pro Pro Ala 
95 100 105 

Cys Pro Gin Asn Leu His Gly Ala Leu Pro Ala lie Met Leu Pro 

110 115 120 

Val Trp Phe Thr Asp Asn Leu Glu Ala Ala Ala Thr Tyr Val Gin 

125 130 135 

Asn Gin Ser Glu Asp Cys Leu Tyr Leu Asn Leu Tyr Val Pro Thr 

140 145 150 

Glu Asp Gly Pro Leu Thr Lys Lys Arg Asp Glu Ala Thr Leu Asn 

155 160 165 

Pro Pro Asp Thr Asp lie Arg Asp Pro Gly Lys Lys Pro Val Met 

170 175 180 

Leu Phe Leu His Gly Gly Ser Tyr Met Glu Gly Thr Gly Asn Met 

185 190 195 

Phe Asp Gly Ser Val Leu Ala Ala Tyr Gly Asn Val lie Val Ala 

200 205 210 

Thr Leu Asn Tyr Arg Leu Gly Val Leu Gly Phe Leu Ser Thr Gly 

215 220 225 

Asp Gin Ala Ala Lys Gly Asn Tyr Gly Leu Leu Asp Gin lie Gin 

230 235 240 

Ala Leu Arg Trp Leu Ser Glu Asn lie Ala His Phe Gly Gly Asp 

245 250 255 

Pro Glu Arg lie Thr lie Phe Gly Ser Gly Ala Gly Ala Ser Cys 

260 265 270 

Val Asn Leu Leu lie Leu Ser His His Ser Glu Gly Leu Phe Gin 

275 280 285 

Lys Ala lie Ala Gin Ser Gly Thr Ala lie Ser Ser Trp Ser Val 

290 295 300 

Asn Tyr Gin Pro Leu Lys Tyr Thr Arg Leu Leu Ala Ala Lys Val 

305 310 315 

Gly Cys Asp Arg Glu Asp Ser Ala Glu Ala Val Glu Qys Leu Arg 

320 325 330 

Arg Lys Pro Ser Arg Glu Leu Val Asp Gin Asp Val Gin Pro Ala 

335 340 345 

Arg Tyr His lie Ala Phe Gly Pro Val Val Asp Gly Asp Val Val 

350 355 360 

Pro Asp Asp Pro Glu lie Leu Met Gin Gin Gly Glu Phe Leu Asn 

365 370 375 

Tyr Asp Met Leu lie Gly Val Asn Gin Gly Glu Gly Leu Lys Phe 

380 385 390 

Val Glu Asp Ser Ala Glu Ser Glu Asp Gly Val Ser Ala Ser Ala 

395 400 405 

Phe Asp Phe Thr Val Ser Asn Phe Val Asp Asn Leu Tyr Gly Tyr 

410 415 420 

Pro Glu Gly Lys Asp Val Leu Arg Glu Thr lie Lys Phe Met Tyr 

425 430 435 

Thr Asp Trp Ala Asp Arg Asp Asn Gly Glu Met Arg Arg Lys Thr 
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440 445 450 

Leu Leu Ala Leu Phe Thr Asp His Gin Trp Val Ala Pro Ala Val 

455 460 465 

Ala Thr Ala Lys Leu His Ala Asp Tyr Gin Ser Pro Val Tyr Phe 

470 475 480 

Tyx Thr Phe Tyr His His Cys Gin Ala Glu Gly Arg Pro Glu Trp 

485 490 495 

Ala Asp Ala Ala His Gly Asp Glu Leu Pro Tyr Val Phe Gly Val 

500 505 510 

Pro Met Val Gly Ala Thr Asp Leu Phe Pro Cys Asn Phe Ser Lys 

515 520 525 

Asn Asp Val Met Leu Ser Ala Val Val Met Thr Tyr Trp Thr Asn 

530 535 540 

Phe Ala Lys Thr Gly Asp Pro Asn Gin Pro Val Pro Gin Asp Thr 

545 550 555 

Lys Phe lie His Thr Lys Pro Asn Arg Phe Glu Glu Val Val Trp 

560 565 570 

Ser Lys Phe Asn Ser Lys Glu Lys Gin Tyr Leu His lie Gly Leu 

575 580 585 

Lys Pro Arg Val Arg Asp Asn Tyr Arg Ala Asn Lys Val Ala Phe 

590 595 600 

Trp Leu Glu Leu Val Pro His Leu His Asn Leu His Thr Glu Leu 

605 610 615 

Phe Thr Thr Thr Thr Arg Leu Pro Pro Tyr Ala Thr Arg Trp Pro 

620 625 630 

Pro Arg Pro Pro Ala Gly Ala Pro Gly Thr Arg Arg Pro Pro Pro 

635 640 645 

Pro Ala Thr Leu Pro Pro Glu Pro Glu Pro Glu Pro Gly Pro Arg 

650 655 660 

Ala Tyr Asp Arg Phe Pro Gly Asp Ser Arg Asp Tyr Ser Thr Glu 

665 670 675 

Leu Ser Val Thr Val Ala Val Gly Ala Ser Leu Leu Phe Leu Asn 

680 685 690 

lie Leu Ala Phe Ala Ala Leu Tyr Tyr Lys Arg Asp Arg Arg Gin 

695 700 705 

Glu Leu Arg Cys Arg Arg Leu Ser Pro Pro Gly Gly Ser Gly Ser 

710 715 720 

Gly Val Pro Gly Gly Gly Pro Leu Leu Pro Ala Ala Gly Arg Glu 

725 730 735 

Leu Pro Pro Glu Glu Glu Leu Val Ser Leu Gin Leu Lys Arg Gly 

740 745 750 

Gly Gly Val Gly Ala Asp Pro Ala Glu Ala Leu Arg Pro Ala Cys 

755 760 765 

Pro Pro Asp Tyr Thr Leu Ala Leu Arg Arg Ala Pro Asp Asp Val 

770 775 780 

Pro Leu Leu Ala Pro Gly Ala Leu Thr Leu Leu Pro Ser Gly Leu 

785 790 795 

Gly Pro Pro Pro Pro Pro Pro Pro Pro Ser Leu His Pro Phe Gly 

800 805 810 

Pro Phe Pro Pro Pro Pro Pro Thr Ala Thr Ser His Asn Asn Thr 

815 820 825 

Leu Pro His Pro His Ser Thr Thr Arg Val 

830 835 

<210> 6 
<2U> 558 
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<212> PRT . 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 3883906CD1 

<400> 6 



Met 


Arg 


Lys 


He 


Arg 


Ala 


Asn 


Ala He Ala He 


Leu Thr Val Ala 


1 








5 






10 


15 


Trp 


He 


Leu 


Gly 


Thr 


Phe 


Tyr 


Tyr Leu Trp Gin Asp Asn Arg Ala 










20 






25 


30 


His 


Ala 


Ala 


Ser 


Ser 


Gly 


Gly 


Arg Gly Ala Gin Arg Ala Gly Arg 










35 






40 


45 


Arg 


Ser 


Glu 


Gin 


Leu 


Arg 


Glu 


Asp Arg Thr He 


Pro Leu He Val 










50 






55 


60 


Thr 


Gly 


Thr 


Pro 


Ser 


Lys 


Gly 


Phe Asp Glu Lys Ala Tyr Leu Ser 










65 






70 


75 


Ala 


Lys 


Gin 


Leu 


Lys 


Ala 


Gly 


Glu Asp Pro Tyr Arg Gin His Ala 










80 






85 


90 


Phe 


Asn 


Gin 


Leu 


Glu 


Ser 


Asp 


Lys Leu Ser Pro Asp Arg Pro He 










95 






100 


105 


Arg 


Asp 


Thr 


Arg 


His 


Tyr 


Ser 


Cys Pro Ser Val 


Ser Tyr Ser Ser 










110 






115 


120 


Asp 


Leu 


Pro 


7U.a 


Thr 


Ser 


Val 


He He Thr Phe 


His Asn Glu Ala 










125 






130 


135 


Arg 


Ser 


Thr 


Leu 


Leu 


Arg 


Thr 


Val Lys Ser Val 


Leu Asn Arg Thr 










140 






145 


150 


Pro 


Ala 


Asn 


Leu 


He 


Gin 


Glu 


He He Leu Val 


Asp Asp Phe Ser 










155 






160 


165 


Ser 


Asp 


Pro 


Glu 


Asp 


Cys 


Leu 


Leu Leu Thr Arg 


He Pro Lys Val 










170 






175 


180 


Lys 


Cys 


Leu 


Arg 


Asn 


Asp 


Arg 


Arg Glu Gly Leu 


He Arg Ser Arg 










185 






190 


195 


Val 


Arg 


Gly 


Ala 


Asp 


Val 


Ala 


Ala Ala Thr Val 


Leu Thr Phe Leu 










200 






205 


210 


Asp 


Ser 


His 


Cys 


Glu 


Val 


Asn 


Thr Glu Trp Leu 


Pro Pro Met Leu 










215 






220 


225 


Gin 


Arg 


Val 


Lys 


Glu 


Asp 


His 


Thr Arg Val Val 


Ser Pro He He 










230 






235 


240 


Asp 


Val 


He 


Ser 


Leu 


Asp 


Asn 


Phe Ala Tyr Leu Ala Ala Ser Ala 










245 






250 


255 


Asp 


Leu 


Arg 


Gly 


Gly 


Phe 


Asp 


Trp Ser Leu His 


Phe Lys Trp Glu 










260 






265 


270 


Gin 


lie 


Pro 


Leu 


Glu 


Gin 


Lys 


Met Thr Arg Thr Asp Pro Thr Arg 










275 






280 


285 


Pro 


He 


Arg 


Thr 


Pro 


Val 


He 


Ala Gly Gly He 


Phe Val He Asp 










290 






295 


300 


Lys 


Ser 


Trp 


Phe 


Asn 


His 


Leu 


Gly Lys Tyr Asp Ala Gin Met Asp 










305 






310 


315 


lie 


Trp 


Gly 


Gly 


Glu 


Asn 


Phe 


Glu Leu Ser Phe Arg Val Trp Met 










320 






325 


330 


Cys 


Gly 


Gly 


Ser 


Leu 


Glu 


He 


Val Pro Cys Ser Arg Val Gly His 










335 






340 


345 


Val 


Phe 


Arg 


Lys 


Arg 


His 


Pro 


Tyr Asn Phe Pro Glu 6ly Asn Ala 










350 






355 


360 
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Leu Thr Tyr lie Arg Asn Thr Lys Arg Thr Ala Glu Val Trp Met 

365 370 375 

Asp Glu OVr Lys Gin Tyr Tyr Tyr Glu Ala Arg Pro Ser Ala lie 

380 385 390 

Gly Lys Ala Phe Gly Ser Val Ala Thr Arg lie Glu Gin Arg Lys 

395 400 405 

Lys Met Asn Cys Lys Ser Phe Arg Trp Tyr Leu Glu Asn Val Tyr 

410 415 420 

Pro Glu Leu Thr Val Pro Val Lys Glu Ala Leu Pro Gly He He 

425 430 435 

Lys Gin Gly Val Asn cys Leu Glu Ser Gin Gly Gin Asn Thr Ala 

440 445 450 

Gly Asp Phe Leu Leu Gly Met Gly He Cys Arg Gly Ser Ala Lys 

455 460 465 

Asn Pro Gin Pro Ala Gin Ala Trp Leu Phe Ser Asp His Leu lie 

470 475 480 

Gin Gin Gin Gly Lys Cys Leu Ala Ala Thr Ser Thr Leu Met Ser 

485 490 495 

Ser Pro Gly Ser Pro Val He Leu Gin Met Cys Asn Pro Arg Glu 

500 505 510 

Gly Lys Gin Lys Trp Arg TVrg Lys Gly Ser Phe He Gin His Ser 

515 520 525 

Val Ser Gly Leu Cys Leu Glu Thr Lys Pro Ala Gin Leu Val Thr 

530 535 540 

Ser Lys Cys Gin Ala Asp Ala Gin Ala Gin Gin Trp Gin Leu Leu 

545 550 555 

Pro His Thr 



<210> 7 
<211> 277 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Xncyte ID No: 7473644CD1 

<400> 7 

Met Ser Ser Cys Ser Arg Val Ala Leu Val Thr Gly Ala Asn Lys 
1 5 10 • 15 

Gly He Gly Phe Ala He Thr Arg Asp Leu Cys Arg Lys Phe Ser 

20 25 30 

Gly Asp Val Val Leu Thr Ala Arg Asp Glu Ala Arg Gly Arg Ala 

35 40 45 

Ala Val Gin Gin Leu Gin Ala Glu Gly Leu Ser Pro Arg Phe His 

50 55 60 

Gin Leu Asp He Asp Asp Pro Gin Ser He Arg Ala Leu Arg Asp 

65 70 75 

Phe Leu Arg Lys Glu Tyr Gly Gly Leu Asn Val Leu Val Asn Asn 

8a 85 - 90 

Ala Gly He Ala Phe Arg Met Asp Asp Pro Thr Pro Phe Asp He 

95 100 105 

Gin Ala Glu Val Thr Leu Lys Thr Asn Phe Phe Ala Thr Arg Asn 
110 115 120 

Val Cys Thr Glu Leu Leu Pro He Met Lys Pro His Gly Arg Val 
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125 130 135 

Val Asn lie Ser Sex Leu lie 5er Leu Glu Ala Leu Lys Asn Cys 

140 145 • 150 

Ser Leu Glu Leu Gin Gin Lys Phe Arg Ser Glu Thr He Thr Glu 

155 160 165 

Glu Glu Leu Val Gly Leu Met Asn Lys Phe Val Glu Asp Thr Lys 

170 175 180 

Lys Gly Val His Ala Lys Glu Gly Trp Pro Asn Ser Ala Tyr Gly 

185 190 195 

Val Ser Lys He Gly Val Thr Val Leu Ser Arg He Leu Ala. Arg 

200 205 210 

Lys Leu Asn Glu Gin Arg Arg Gly Asp Lys He Leu Leu Asn Ala 

215 220 225 

Cys Cys Pro Gly Trp Val Arg a?hr Asp Met Ala Gly Pro Gin Ala 

230 235 240 

Thr Lys Ser Pro Glu Glu Gly Ala Glu Thr Pro Val Tyr Leu Ala 

245 250 255 

Leu Leu Pro Pro Asp Ala Glu Gly Pro His Gly Gin Phe Val Gin 

260 265 270 

Asp Lys Lys Val Glu Gin Trp 

275 

<210> 8 
<211> 821 
<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 7485303CD1 

<400> 8 

Met Val Asn Gly Leu Phe Leu Phe Ser Leu Tyr Thr Pro Gin Ala 
15 10 15 

Pro Ser Met Ser Ser Lys Asn Asn Leu Glu Ala Phe Arg Arg Pro 
20 25 30 

Glu Gin Arg Gly His Arg Ala Ala Arg Ser Gin Leu Arg Ser Asp 
35 40 45 

Ala Ala Ser Ala Arg Arg Gly Phe Val Ala Arg Thr Arg Arg Glu 
50 55 60 

Leu Gly Pro Gly Arg Cys Val Arg Pro Arg Pro Arg Leu Leu Ala 
65 70 75 

Pro Thr Ser Leu Asn Asn Asp His Arg Gin Pro Arg Trp Glu Asp 
80 85 90 

Asn Gly Gin Lys Miet Pro Gly His Pro Glu Thr Leu Ser Ala Phe 
95 100 105 

Cys Pro Gly His Arg Leu Leu His Ser Leu Leu Pro Asp Asp Val 
110 115 120 

Ser Gly Arg Pro Val Leu Gly Pro Ala Pro Gly Gly Leu Ala He 
125 130 135 

His His Pro His Arg Arg Met Pro Pro Gly Ala Gly Leu Met Glu 
140 145 150 

Arg He Gin Ala He Ala Gin Asn Val Ser Asp He Ala Val Lys 
155 160 165 

Val Asp Gin He Leu Arg His Ser Leu Leu Leu His Ser Lys Val 
170 175 180 
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Ser Glu Gly Arg Arg Asp Gin Cys Glu Ala Pro Ser Asp Pro Lys 

185 190 195 

Phe Pro Asp Cys Ser Gly Lys Val Glu Trp Met Arg Ala Arg Trp 

200 205 210 

Thr Ser Asp Pro Cys Tyr Ala Phe Phe Gly Val Asp Gly Thr Glu 

215 220 225 

Cys Ser Phe Leu lie Tyr Leu Ser Glu Val Glu Trp Phe Cys Pro 

230 235 240 

Pro Leu Pro Trp Arg Asn Gin Thr Ala Ala Gin Arg Ala Pro Lys 

245 250 255 

Pro Leu Pro Lys Val Gin Ala Val Phe Arg Ser Asn Leu Ser His 

260 265 270 

Leu Leu Asp Leu Met Gly Ser Gly Lys Glu Ser Leu lie Phe Met 

275 280 285 

Lys Lys Arg Thr Lys Arg Leu Thr Ala Gin Trp Ala Leu Ala Ala 

290 295 300 

Gin Arg Leu Ala Gin Lys Leu Gly Ala Thr Gin Arg Asp Gin Lys 

305 310 315 

Gin He Leu Val His He Gly Phe Leu Thr Glu Glu Ser Gly Asp 

320 325 330 

Val Phe Ser Pro Arg Val Leu Lys Gly Gly Pro Leu Gly Glu Met 

335 340 345 

Val Gin Trp Ala Asp He Leu Thr Ala Leu Tyr Val Leu Gly His 

350 355 360 

Gly Leu Arg Val Thr Val Ser Leu Lys Glu Leu Gin Ser Asn Leu 

365 370 375 

Gly Val Pro Pro Gly Arg Gly Ser Cys Pro Leu Thr Met Pro Leu 

380 385 390 

Pro Phe Asp Leu He Tyr Thr Asp Tyr His Gly Leu Gin Gin Met 

395 400 405 

Lys Arg His Met Gly Leu Ser Phe Lys Lys Tyr Arg Cys Arg He 

410 415 420 

Arg Val He Asp Thr Phe Gly Thr Glu Pro Ala Tyr Asn His Glu 

425 430 435 

Glu Tyr Ala Thr Leu His Gly Tyr Arg Thr Asn Trp Gly Tyr Trp 

440 445 450 

Asn Leu Asn Pro Lys Gin Phe Met Thr Met Phe Pro His Thr Pro 

455 460 465 

Asp Asn Ser Phe Met Gly Phe Val Ser Glu Glu Leu Asn Glu Thr 

470 475 480 

Glu Lys Arg Leu He Lys Gly Gly Lys Ala Ser Asn Met Ala Val 

485 490 495 

Val Tyr Gly Lys Glu Ala Ser He Trp Lys Gly Lys Glu Lys Phe 

500 505 510 

Leu Gly He Leu Asn Lys Tyr Met Glu He His Gly Thr Val Tyr 

515 520 525 

Tyc Glu Ser Gin Arg Pro Pro Glu Val Pro Ala Phe Val Lys Asn 

530 535 540 

His Gly Leu Leu Pro Gin Pro Glu Phe Gin Gin Leu Leu Arg Lys 

545 550 555 

Ala Lys Leu Phe He Gly Phe Gly Phe Pro Tyr Glu Gly Pro Ala 

560 565 • 570 

Pro Leu Glu Ala He Ala Asn Gly Cys He Phe Leu Gin Ser Arg 

575 580 585 

Phe Ser Pro Pro His Ser Ser Leu Asn His Glu Phe Phe Arg Gly 

590 595 600 
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Lys Pro Thr Ser Arg Glu Val Phe Ser Gin His Pro Tyx Ala Glu 

605 610 615 

Asn Phe He Gly Lys Pro His Val Trp Thr Val Asp Tyz Asn Asn 

620 625 630 

Ser Glu Glu Phe Glu Ala Ala He Lys Ala He Met Arg Thr Gin 

635 640 645 

Val Asp Pro Tyr Leu Pro Tyr Glu Tyr Thr cys Glu Gly Met Leu 

650 655 660 

Glu Arg He His Ala Tyr He Gin His Gin Asp Phe Cys Arg Ala 

665 670 675 

Pro Asp Pro Ala Leu Pro Glu Ala His Ala Pro Gin Ser Pro Phe 

680 685 690 

Val Leu Ala Pro Asn Ala Thr His Leu Glu Trp Ala Arg Asn Thr 

695 700 705 

Ser Leu Ala Pro Gly Ala Trp Pro Pro Ala His Ala Leu Arg Ala 

710 715 720 

Trp Leu Ala Val Pro Gly Arg Ala Cys Thr Asp Thr Cys Leu Asp 

725 730 735 

His Gly Leu He Cys Glu Pro Ser Phe Phe Pro Phe Leu Asn Ser 

740 745 750 

Gin Asp Ala Phe Leu Lys Leu Gin Val Pro Cys Asp Ser Thr Glu 

755 760 765 

Ser Glu Met Asn His Leu Tyr Pro Ala Phe Ala Gin Pro Gly Gin 

770 775 780 

Glu Cys Tyr Leu Gin Lys Glu Pro Leu Leu Phe Ser Cys Ala Gly 

785 790 795 

Ser Asn Thr Lys Tyr Arg Arg Leu Cys Pro cys Arg Asp Phe Arg 

800 805 810 

Lys Gly Gin Val Ala Leu Cys Gin Gly Cys Leu 

815 820 

<210> 9 
<211> 159 
<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 6970969CD1 

<400> 9 

Met Thr He Glu He Arg Pro Ala Val Pro Ser Asp Ala Ala Gin 
1 5 10 15 

He Leu Thr Phe He Thr Glu Leu Ala Glu Tyr Glu Lys Ala Arg 

20 25 30 

His Glu Val He Ala Ser Val Val Asp He Glu Arg Ser Leu Phe 

35 40 45 

Ser Glu Gly Ala Thr Ala His Gly Leu He Cys Leu Arg Asp Gly 

50 55 60 

Leu Pro He Gly Phe Ala Val Phe Phe Phe Ser Tyr Ser Thr Trp 

65 70 75 

Leu Gly Ser Asn Cys Leu Tyr Leu Glu Asp Leu lyr He Asn Pro 

80 85 90 

Glu Gin Arg Gly Gly Gly Ala Gly Lys Lys Leu Leu Arg His Leu 

95 100 105 

Ala Lys He Ala Phe Asp Asn Gly Cys Gly Arg Phe Glu Trp Ser 
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110 



115 



120 



Val Leu Asp Trp Asn Glu Pro Ala He Ala Phe Tyr Lys Ser He 
125 130 135 

Gly Ala Gin Pro Gin Glu Glu Trp Val Arg Tyr Arg Met Glu Gly 
140 145 150 

Asp Ala Leu Arg Asp Phe Ala Leu Gly 



<210> 10 

<211> 642 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> inisc_feature 

<223> Incyte ID No: 6538080CD1 

<400> 10 

Met Val Val Val Pro Phe Thr He Pro Phe Asp Ser Ser Val Trp 
15 10 15 

Leu Leu Gin Lys Leu Asn Ser Pro Trp Arg Met Thr Val Asp Tyr 
20 25 30 

His Thr Leu Asn Gin Ala Val Thr Leu He He Ala Ala Val Leu 
35 40 45 

Gly Val Ala Ser Leu Leu Glu Gin Gly Phe Ser Arg Ala He Trp 
50 55 60 

Glu Asn Arg Glu Ser Leu Ser Lys Val Cys Val Ser Gin Gly Pro 
65 70 75 

Ser Arg Pro Leu Ala Cys Ala Thr Asn Gly Asp He Lys Val Gin 
80 85 90 

Gly Gly Pro Ser Ala Glu Gly Pro Gin Arg Asn Thr Arg Leu Gly 
95 100 105 

Trp He Gin Gly Lys Gin Val Thr Val Leu Gly Ser Pro Val Pro 

110 115 120 

Val Asn Val Phe Leu Gly Val Pro Phe Ala Ala Pro Pro Leu Gly 

125 130 135 

Ser Leu Arg Phe Thr Asn Pro Gin Pro Ala Ser Pro Trp Asp Asn 

140 145 150 

Leu Arg Glu Ala Thr Ser Tyr Pro Asn Leu Cys Leu Gin Asn Ser 

155 160 165 

Glu Trp Leu Leu Leu Asp Gin His Met Leu Lys Val His Tyr Pro 

170 175 180 

Lys Phe Gly Val Ser Glu Asp Cys Leu Tyr Leu Asn He Tyx Ala 

185 190 195 

Pro Ala His Ala Asp Thr Gly Ser Lys Leu Pro Val Leu Val Trp 

200 205 210 

Phe Pro Gly Gly Ala Phe Lys Thr Gly Ser Ala Ser He Phe Asp 

215 220 225 

Gly Ser Ala Leu Ala Ala Tyr Glu Asp Val Leu Val Val Val Val 

230 235 240 

Gin Tyr Arg Leu Gly He Phe Gly Phe Phe Thr Thr Trp Asp Gin 

245 250 255 

His Ala Pro Gly Asn Trp Ala Phe Lys Asp Gin Val Ala Ala Leu 

260 265 270 

Ser Trp Val Gin Lys Asn He Glu Phe Phe Gly Gly Asp Pro Ser 



155 



275 



280 



285 
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Ser Val Thr He Phe Gly Glu Ser Ala Gly Ala He Ser Val Ser 

290 295 300 

ser Leu He Leu Ser Pro Met Ala Lys Gly Leu Phe His Lys Ala 

305 310 315 

lie Met Glu Ser Gly Val Ala He He Pro Tyr Leu Glu Ala His 

320 325 330 

Asp Tyr Glu Lys Ser Glu Asp Leu Gin Val Val Ala His Phe Cys 

335 340 345 

Gly Asn Asn Ala Ser Asp Ser Glu Ala Leu Leu Arg Cys Leu Arg 

350 355 360 

Thr Lys Pro Ser Lys Glu Leu Leu Thr Leu Ser Gin Lys Thr Lys 

365 370 375 

Ser Phe Thr Arg Val Val Asp Gly Ala Phe Phe Pro Asn Glu Pro 

380 . 385 390 

Leu Asp Leu Leu Ser Gin Lys Ala Phe Lys Ala He Pro Ser He 

395 400 405 

He Gly Val Asn Asn His Glu Cys Gly Phe Leu Leu Pro Met Lys 

410 415 420 

Glu Ala Pro Glu Val Leu Ser Gly Ser Asn Lys Ser Leu Ala Leu 

425 430 435 

His Leu He Gin Asn He Leu His He Pro Pro Gin Tyr Leu His 

440 445 450 

Leu Val Ala Asn Glu Tyr Phe His Asp Lys His Ser Leu Thr Glu 

455 460 465 

He Arg Asp Ser Leu Leu Asp Leu Leu Gly Asp Val Phe Phe Val 

470 . 475 480 

Val Pro Ala Leu He Thr Ala Arg Tyr His Arg Asp Ala Gly Ala 

485 490 495 

Pro Val OVr Phe Tyr Glu Phe Arg His Arg Pro Gin Cys Phe Glu 

500 505 510 

Asp Thr Lys Pro Ala Phe Val Lys Ala Asp His Ala Asp Glu Val 

515 520 525 

Arg Phe Val Phe Gly Gly Ala Phe Leu Lys Gly Asp He Val Met 

530 535 540 

Phe Glu Gly Ala Thr Glu Glu Glu Lys Leu Leu Ser Arg Lys Met 

545 550 555 

Met Lys Tyr Trp Ala Thr Phe Ala Arg Thr Gly Asn Pro Asn Gly 

560 565 570 

Asn Asp Leu Phe Leu Trp Pro Ala Tyx Asn Leu Thr Glu Gin Tyr 

575 580 585 

Leu Gin Leu Asp Leu Asn Met Ser Leu Gly Gin Arg Leu Lys Glu 

590 595 600 

Pro Arg Val Glu Phe Trp Thr Ser Thr He Pro Leu He Leu Ser 

605 610 615 

Ala Ser Asp Met Leu His Ser Pro Leu Ser Ser Leu Thr Phe Leu 

620 625 630 

Ser Leu Leu Gin Pro Phe Phe Phe Phe Cys Ala Pro 

635 640 



<210> 11 

<211> 109 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> mis cofeature 
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<223> Incyte ID No: 55048919CD1 
<400> 11 

Met His Phe His Ala Thr Gly Asp Cys Sex Asp Gin Gly Phe Gin 
15 10 15 

Lys Ser Gly Ala His lie Asn His Glu Asp His Lys Thr Pro His 

20 25 30 

Gly Leu Leu Asn Pro Glu Gly Pro Asp Phe Gly Glu Leu Pro Asn 

35 40 45 

He His Val Ala Ala Asp Gly Thr Val Asn Ala Glu Ala Phe Ser 

50 55 60 

Ala Leu Val Ser Leu Asp Ala. Ala Ser Ser Arg Pro Asn Leu Leu 

65 70 75 

Asp Ala Asp Gly Ser Ala Leu Val He His Ala Ser Pro Asp Asp 

80 85 90 

His Val Thr Gin Pro He Gly Gly Ala Gly Ala Arg Val Ala Cys 

95 100 105 

Ala Val He Arg 



<210> 12 

<211> 222 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 7485135CD1 

<400> 12 

Met Ala Glu Lys Pro Lys Leu His Tyr Ser Asn Ala Gin Gly Arg 
15 10 15 

Met Glu Ser Thr Trp Trp Leu Leu Ala Ala Ala Gly Val Glu Phe 
20 25 30 

Glu Glu Lys Phe Leu Glu Ser Ala Glu Asp Leu Glu Lys Leu Arg 
35 40 45 

Asn Asp Gly Ser Phe Met Phe Gin Gin Val Pro Met Val Glu He 
50 55 60 

Asp Gly Met Lys Leu Val Gin Thr Arg Ala He Leu Asn Tyr Val 
65 70 75 

Ala Ser Lys Tyr Asn Leu Tyr Arg Lys Asp He Lys Glu Arg Val 
80 85 90 

Leu He Asp Met Tyr Thr Glu Gly He Ala Asp Leu Gly Glu Met 
95 100 105 

He Leu Pro Leu Pro He Cys Gin Pro Glu Asp Lys Asp Thr Lys 
110 115 120 

Leu Ala Leu Asn Gin Gin Arg Thr Lys Asn Arg Tyr Phe Pro Ala 
125 130 135 

Phe Glu Lys Val Leu Lys Ser Asn Gly Gin Asp Tyr Leu Val Gly 
140 145 150 

Asn Lys Leu Ser Trp Ala Asp He His Leu Val Glu Leu Leu Tyr 
155 160 165 

Tyr Met Glu Glu Leu Asp Ser Ser Leu He Phe Ser Phe Pro Leu 
170 175 180 

Leu Lys Ala Leu Lys Thr Arg He Ser Asn Leu Pro Met Val Lys 
185 190 195 
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Lys Phe Leu Gin Pro Gly Ser Pro Arg Lys Ser Leu Met Asp Glu 
200 205 210 

Lys Ser Leu Glu Glu Ala Arg Lys lie Phe Arg Phe 
215 220 

<210> 13 

<211> 273 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 7684978CD1 

<400> 13 



Met 


Arg 


Tyr Val Lys Leu 


Ala 


Gly Cys Ser 


Val Pro Ala He Gly 


1 




5 




10 


15 


Gin 


Gly 


Thr Arg Tyr Met 


Gly 


Glu Asp Pro 


Ala Arg Lys Ala Ala 






20 




25 


30 


Glu 


Val 


Ala Ala Leu Gin 


Tyr 


Gly He Glu 


Gin Gly Met Thr Leu 






35 




40 


45 


lie 


Asp 


Ser Ala Glu Met 


Tyr 


Ala Glu Gly 


Gly Ala Glu Lys Val 






50 




55 


60 


Val 


Gly 


Gin Ala lie Ala 


Gly 


Arg Arg Asp 


Gin Val Phe Leu Val 






65 




70 


75 


Ser 


Lys 


Val Tyr Pro His 


Asn 


Ala Ser Arg 


Arg Gly Val Pro Glu 






80 




85 


90 


Ala 


Cys 


Glu Arg Ser Leu 


Lys 


Arg Leu Gly 


Thr Glu Val He Asp 






95 




100 


105 


Leu 


Tyr 


Leu Leu His Trp 


Arg 


Gly Gin Tyr 


Pro Leii Glu Glu Thr 






110 




115 


120 


Val 


Glu 


Ala Phe Glu Arg 


Leu 


Arg Glu Gin 


Gly Lys He Arg His 






125 




130 


135 


Trp 


Gly 


Val Ser Asn Phe 


Asp 


Val Asp Asp 


Leu Arg Glu Leu His 






140 




145 


150 


Asn 


Pro 


Asp Cys Ala Thr 


Asn 


Gin Val Leu 


Tyr Asn Pro Ala Gin 






155 




160 


165 


Arg 


Gly 


lie Glu Phe Asp 


Leu 


Leu Pro Trp 


Ser Arg Gin Arg Gly 






170 




175 


180 


Leu 


Pro 


Thr Met Ala Tyr 


Cys 


Pro Leu Ala 


Gin Ala Gly 61n Leu 






185 




190 


195. 


Leu 


Gin 


His Pro V£a Leu 


Ser 


Glu He Ala 


Glu Arg His Gly Ala 






200 




205 


210 


Arg 


Pro 


Ala Gin Val Ser 


Leu 


Ala Trp Val 


Thr Arg Asp Asn Asp 






215 




220 


225 


Val 


lie 


Ala He Pro Lys 


Ala 


Val Ser Ser 


Glu His Val Arg Leu 






230 




235 


240 


Asn 


Ala 


Ala Ala Gly Ala 


Leu 


Thr Leu Thr 


Ser Glu Asp Leu Arg 






245 




250 


255 


Ala 


lie 


Asp Arg Ala Phe 


Pro 


Ala Pro Thr 


Arg Lys Gin His Leu 






260 




265 


270 


Ala 


Met 


Val 









<210> 14 
<211> 2004 



18/29 



wo 02/46426 ^ PCT/USOl/47429 

<212> DMA 

<213> Homo sapiens 



<220> 

<221> mlsc^feature 

<223> Incyte ID No: 7483477CB1 



<400> 14 

catacactcc acctggggac agaaggaaaa 
aataaaacaa catgttttcc ctatctgaat 
gtgactttcc cagtgaagaa aagacagagc 
aaaatcaata gcctaaaaag ataataattt 
caattctgtt gagctgagag agaagaacag 
gccccagagg gtttccctgt gtgaattcaa 
ttatataacc acccccaata taaagcaatc 
cttgaatccc agggtgttca gattttggta 
ggatctggtg cccaggacag aacctgttga 
accatctttc tggcactccg cttgtcttgc 
gataaggcag gaaagctgcc cccaggtcct 
caagttcgaa ctgacgctac gtttcagtct 
gtgttcactg tgtacatggg tccccggcca 
aaggaagcct tgatagacca agcggatgag 
aagcaaaact tccaaggtca tggtgtagct 
cgacgcttct ccctgaccat ccttcgggac 
cgaattcagg aggaggccag ctacctactg 
atcgacccca tcttcttgct gagccgcacc 
agaagtcgct tcgactatga ggacaagcag 
agtttcattg agatgagcac tccctgggca 
cagtatttgc caggaagaca caatctcatc 
attgcctcca gggtcaagat caacgaagca 
attgactgct ttctcatcaa gatgcaccag 
ctcaagaact tggtcctcac cactctgaac 
tctacactgc gctatggatt cttgctgtta 
catgaagaga ttaaccaagt gattggacca 
aagatgccct acacagatgc cgtcatccac 
^tgggtgtcc cccacaacct catccgggac 
ggcacagatg tatttcccct gcttggctct 
ccagatgcct tctatcccca gcactttctg 
gcctttgtgc ctttttcctc tggaaagcgc 
gaactctttc tctacttcac ctccaccctt 
cctgtggaca tcgatatcac ccccaagctc 
gagctctgtc ttgtggcccg ctga 



tctggtctca aaaaataaaa cattaacaaa 60 
agatggggga aacaagtccc cagaggttaa 120 
tgccacttgt acccagattg tctggatcct 180 
gcctcttaca ccccaccaag aaattctgtt 240 
ggagactgga aatatgaaga aaggggaatg 300 
tttaccttta ttcaggtggg agattatagg 360 
tcttccccga tgacactcaa acccggaaaa 420 
taagctgatc tcccaggacc tacttaggca 480 
cctggcacca tggagctggg aggggctgtc . 540 
ctgctcatcc tcattgcctg gaaacgaatg 600 
acaccaattc ctttcctggg gaacctgctg 660 
ttcatgaagc tcagggagaa atacagccct 720 
gtggtggttt tatgtggaca cgaagcagtg 780 
ttcagtggcc gtggagaact ggcttcaata 840 
ctggccaatg gagaacgatg gaggattctc 900 
ttcgggatgg gaaagcagag cattaaggag 960 
gaggagttcc agaagaccaa gggtgccccc 1020 
gtttccciacg tcatcagttc cgttgtcttc 1080 
ttcctgaatc tgctgcggtt gatcaatgag 1140 
cagctatatg acatgtactc tggaatcatg 1200 
tactacttgg tagaagagct caaggacttc 1260 
tcctttgacc cccaaaaccc tcgggacttc 1320 
gataaaaata atcctcgtac agaattcaac 1380 
ctcttctttg ctggcacgga gacggtgagc 1440 
atgaagcatc ctgaagtgga agctaagatc 1500 
caccggctcc caagggtgga tgaccgggtc 1560 
gaaatacaga gactggtaga catcgtgccc 1620 
actcagtttc gaggctacct: tctgcccaag 1680 
gtcctcaaag accccaaata cttccgctac 1740 
gatgagcagg gccgcttcaa gaagaatgaa 1800 
atctgcctgg gcgaggccat ggaccgcatg 1860 
cagaacttct ctctacactc gctggtgccg 1920 
tcaggctttg gcaacatccc cccgacctac 1980 

2004 



<210> 15 
<211> 1191 
<212> 

<213> Homo sapiens 
<220> 

<221> misc_£eature 

<223> Incyte ID No: 7485159CB1 



<400> 15 

aaagccccgg ctgccggcgg cggaccacct 

gccctagtgc tttccagcgg atttcccctc 
gcgatgtctc cccaggttaa attaccctag 



ctgctgccgc gcgcctaccg gagccgcttg 60 
agctcctact ctcgggcttc caaatctggg 120 
ctcctgctcc agatcgcttc cccgtgcccc 180 
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gccagagccc agtagttcaa aaattaaatt 
tgtttctgga gcctgcggca gcggtggcga 
gctgcgcaaa ccacctggag accatgtctg 
gccagccccc agggccagtc ccggaggggc 
cctattctca caggacccgc ctcgtcctca 
acattaacct gagaaacaag cctgaatggt 
ctgtcctgga gaccagccEia tgtcaactga 
tggatgatgc ttatccagga aggaagctgt 
aaaagatgtt attggagcta ttttgtaagg 
cgttgagatg tgggagagaa tgcactaatc 
acctggaaga gattcttgag tatcagaaca 
tgattgatta cctcctctgg ccctggtttg 
gtgtgagcca cacgccagcc ctgcggctct 
tctgtgctct tctcatggat aagagcattt 
acaaccctaa tgcctttgac tttgggctgt 
ccagaattCG ccagcttgtt gggagtctac 
tctctttctt ttctttgaag ttcccaataa 



tggggcaagg ggtgcgcgcc agagcgcagc 240 
gccacagggc ggcgaccgtg agctccggga 300 
gggatgcgac caggaccctg gggaaaggaa 360 
tgatccgcat ctacagcatg aggttctgcc 420 
aggccaaaga catcagacat gaagtggtca 480 
actatacaaa gcaccctttt ggccacattc 540 
tctatgaatc tgttattgct tgtgagtacc 600 
ttccatatga cccttatgaa cgagctcgcc 660 
tcccacattt gaccaaggag tgcctggtag 720 
tgaaggcagc cctgcgtcag gaattcagca 780 
ccaccttctt tggtggaacc tgtatatcca 840 
agcggctgga tgtgtatggg atactggact 900 
ggatatcagc catgaagtgg gaccccacag 960 
tccagggctt cttgaatctc tattttcaga 1020 
gctgagtctc actgtccacc ccttcgctgt 1080 
gtcacggctt gtcttgggaa ccaatccgtc 1140 
aatgaaaaca ggaaasi a aaa a 1191 



<210> 16 

<211> 3419 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> inisc_f eature 

<223> Incyte ID No: 7485518CB1 



<400> 16 

tttgggcgta tccgttcccc ttaccccttt 
gatcccaatc cctattggga attagagtta 
ctccagaaag cccgatacac taacaagagg 
actggtattc cgggtgatta aatagttctc 
gtgtaattct cgtgtccaag aaggaactgc 
atcattccag ctgctgcttg ttttgaaaag 
tggaagagtt cagacactaa gactggagct 
gattgcatga actgaaagct cct:tct:aat:t 
atatctcatg ttctaacacg ggataaattc 
tctctagatt atgcttagag gattctgcag 
ctcaacgtat atccttccca tctattgcct 
aacattttac atgatttgcg atggttacat 
tcatacttta ttcactatgg cactttagga 
tattctaaaa aatgaatctt attgtgaaac 
tcttagctac cttttgcttg gtgagcattg 
acaaacagga aatgacactt attgaaacca 
ttctaccata taggtcaatg gagctgaaaa 
accctactgt ccttctcttc gtggagagcc 
ctattttgga gtccagccga tttcagtacc 
tacctcctct tacagataat ggcaaaggga 
tgaagtatgt cagcatggac tcatggaatc 
acagtgttag tatciatcggt tttcataaag 
taaaaggctt tccgttaciac cttttcaata 
ctcaatctcc tttgctgcat attaccaaag 
gggaagactg gactattttc caiatataatc 
agttacagac agaaaaatcc ctgtcatcct 
ttcaggatct ggggcttcat gatggaattc 
tttggctgca caagctcatc ttcatagatg 



tacccatgga gccttaactc gcgcccggtg 60 
gactcccatc aggctactta ctcgattcgg 120 
cacaccacca ctaccacagc agacaggacc 180 
ataccttttg ctgtactcgt ctgctccctg 240 
attttaacgg tgtgctgaac agaggtttga 300 
gacatgtgtg tataaataaa aggactctgc 360 
agagagtgct cttggaatat tgttgaggca 420 
aacctggagc caagtgaacc tgaatiactigg 480 
aagttiagaaa aagacaaata ttgaaatgct 540 
cattctcagt attggacata. ttcaaatctc 600 
tccagcttta actcagttgt gctcccttca 660 
gaactctggg gaagtgatat ttgcaacatc 720 
aacgaaattg ggaaaaagct tccaaaacat 780 
ttcggagaag ttttcgaaca ttgattgttc 840 
tcatttctgc ctattttctc tactctggct 900 
ctgcagaagc agaatgcact: gacatcaaaa 960 
cagttaaacc tattgacaca tccaaaacgg 1020 
aatactctca actcggtcaa gatatcatag 1080 
acatggttat tgcccctgga aagggagata 1140 
aatatacttt agttatttat gaaaatattc 1200 
gagagctttt agaaaaatac tgtgtggaat 1260 
ccaatgagaa cagcttacca agtacacaat 1320 
atctagctct aaaggactgt tttgttciacc 1380 
cccccaaggt tgagaaaggc cctcttcctg 1440 
attcaaccta ccagcctgta cttttaactg 1500 
tgtctagcaa aacactcttt gcaacggtga 1560 
agagagtact ttttggcaac aacttgaact 1620 
ccatctcctt cttgtcaggg aagaggctga 1680 
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cattgtcctt ggacaggtac atccttgtgg acattgatga tatatttgtt gggaaagagg 1740 
gaacaaggat gaatgtcaaa gatgtgaagg taagcttatt tgagactcaa aatttactgc 1800 
gcactcaggt tgcaaatttt accttcaacc ttggattttc agggaagttt taccacacag 1860 
ggactgaaga ggaagatgaa ggagatgacc ttttacttcg gtctgtggat gagttctggt 1920 
ggtttcctca catgtggagc cacatgcagc cccacctctt ccacaacgag tcatctttag 1980 
tagagcagat gattctcaac aaggaatttg cactggaaca tggaatacca atcaacatgg 2040 
gctatgctgt ggccccacat cactcagggg tctacccggt tcacattcag ctgtatgcag 2100 
cttggaagaa ggtctggggt attcaagtca ccagcactga agaatatcca catctgaaac 2160 
ctgcccggta cagaaagggc ttcattcaca atagcatcat ggtcctccct cgacagactt 2220 
gtgggttgtt cactcacact attttctaca aagaatatcc aggaggaccc caagaactgg 2280 
ataaaagtat cagaggaggt gsiactttttc tcacaatcct tctaaaccca atcagcattt 2340 
tcatgaccca tttatcaaac tatggaaatg accgcctagg gttatatacc tttgtgaact 2400 
tggtcaactt tgtgcagagc tggaccaacc tgaaattgca aactctgcct ccagtgcaac 2460 
tggcccacca gtattttgag ctcttccctg agcagaaaga ccctctatgg cagaatccat 2520 
gtgatgataa acgacacaaa gacatctggt ccagagagaa aacttgtgac cacttaccaa 2580 
siatttctagt aattggtcca caaaaaacag gaacaactgc actttattta tttcttctta 2640 
tgcatccttc aatcatcagc aatctcccta gtccaaagac atttgaggaa gttcagttct 2700 
ttaatggcaa caactatcac aaaggaatag actggtatat ggactttttc cccatccctt 2760 
ccaacaccac ctccgacttc tactttgaga aaagcgccaa ctactttgat tcagaagtgg 2820 
cgccccggcg ggcagcagcc ctcttgccca aagccaaggt cctgaccatc ctcatcaacc 2880 
ccgcggaccg ggcctattcc tggtaccagc accagcgagc ccatgacgac ccagtggccc 2940 
taaagtacac cttccatgag gtgattaccg ccggctctga cgcatcctcg aagctgcgtg 3000 
ccctccagaa ccgctgcctg gtccctggct ggtacgccac ccacatcgag cgctggctca 3060 
gtgcctatca cgccaaccag gtactggtct tggatggcaa actgcttcgc acagaacctg 3120 
ccaaagtgat ggacatggtg cagaagttcc ttggggtgac caacaccatt gactaccaca 3180 
asiaccttggc gtttgatcca aagaaaggat tttggtgcca actgcttgaa ggaggaaaaa. 3240 
ccaagtgtct gggcaaaagc aagggccgga aatatcccga gatggacttg cagtcccgag 3300 
ccttcctgaa ggactattac cgggaccaca acatcgagct ctccaagctg ctgtataaga 3360 
tgggccagac acttcccact tggttacgag aggacttcca gaacaccagg tagccgtgg 3419 

<210> 17 

<211> 2745 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: 2860635CB1 

<400> 17 

atgtgggggc gcacggcgcg gcggcgctgc ccgcgggaac tgcggcgcgg ccgggaggcg 60 
ctgttggtgc tcctggcgct actggcgttg gccgggctgg gctcggtgct gcgggcgcag 120 
cgtggggccg gggccggggc tgccgagccg ggacccccgc gcaccccgcg ccccgggcgg 180 
cgcgagccgg tcatgccgcg gccgccggtg ccggcgaacg cgctgggcgc gcggggcgag 240 
gcggtgcggc tgcagctgca gggcgaggag ctgcggctgc aggaggagag cgtgcggctg 300 
caccagatta acatctacct cagcgaccgc atctcactgc accgccgcct gcccgagcgc 360 
tggaacccgc tgtgcaaaga gaagaaatat gattatgata atttgcccag gacatctgtt 420 
atcatagcat tttataatga agcctggtca actctccttc ggacagttta cagtgtcctt 480 
gagacatccc cggatatcct gctagaagaa gtgatccttg tagatgacta cagtgataga 540 
gagcacctga aggagcgctt ggccaatgag ctttcgggac tgcccaaggt gcgcctgatc 600 
cgcgccaaca agagagaggg cctggtgcga gcccggctgc tgggggcgtc tgcggcgagg 660 
ggcgatgttc tgaccttcct ggactgtcac tgtgagtgcc acgaagggtg gctggagccg 720 
ctgctgcaga ggatccatga agaggagtcg gcagtggtgt gcccggtgat tgatgtgatc 780 
gactggaaca ccttcgaata cctggggaac tccggggagc cccagatcgg cggtttcgac 840 
tggaggctgg tgttcacgtg gcacacagtt cctgagaggg agaggatacg gatgcaatcc 900 
cccgtcgatg tcatcaggtc tccaacaatg gctggtgggc tgtttgctgt gagtaagaaa 960 
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tattttgaat atctggggtc ttatgataca ggaatggaiag tttggggagg agaaaacctc 1020 
gaattttcct ttaggatctg gcagtgtggt ggggttctgg aaacacaccc atgttcccat 1080 
gttggccatg ttttccccaa gcaagctccc tactcccgca acaaggctct ggccaacagt 1140 
gttcgtgcag ctgaagtatg gatggatgaa tttaaagagc tctactacca tcgcaacccc 1200 
cgtgcccgct tggaaccttt tggggatgtg acagagagga agcagctccg ggacaagctc 1260 
cagtgtaaag acttcaagtg gttcttggag actgtgtatc cagaactgca tgtgcctgag 1320 
gacaggcctg gcttcttcgg gatgctccag aacciaaggac taacagacta ctgctttgac 1380 
tataaccctc ccgatgaaaa ccagattgtg ggacaccagg tcattctgta cctctgtcat 1440 
gggatgggcc agaatcagtt tttcgagtac acgtcccaga aagaaatacg ctataacacc 1500 
caccagcctg agggctgcat tgctgtggaa gcaggaatgg atacccttat catgcatctc 1560 
tgcgaagaaa ctgccccaga gaatcagaag ttcatcttgc aggaggatgg atctttattt 1620 
cacgaacagt ccaagaaatg tgtccaggct gcgaggaagg agtcgagtga cagtttcgtt 1680 
ccactcttac gagactgcac Cc^ctcggat catcagaaat ggttcttcaa agagcgcatg 1740 
ttatgaagcc tcgtgtatca aggagcccat cgaaggagac tgtggagcca ggactctgcc 1800 
caacaaagac ttagctaagc agtgaccaga acccaccaaa aactaggctg cattgctttg 1860 
aagaggcciat cattttgcca tttgtgaaag ttgtgttgga tttagtaaaa atgtgaataa 1920 
gctttgtact tattttgaga actttttaaa tgttccaaaa taccctattt tcaaagggta 1980 
atcgtaagat gttaaccctt ggtatttaga aaattaaaac cttataatat ttttctatca 2040 
agatgtatat tttacagtcg tgccttttac tctcattagc aaaaaagata aagattttat 2100 
tttggtattt acaagaattc ccaggtacga agatatctgc atgggtggaa atcaggttca 2160 
agcaacgtac tttgcattaa ctgataatac ctcagctgcg gggttaaagt tttcccagta 2220 
tagagagact gtcactagga acattgtatt gatttattca ggtcattgag atcttctaga 2280 
tgtattttaa aaagaatgct ttttggttat gtgttgctac cacagttaac actccataat 2340 
gttcatgtca gccaaagagg actaaccaaa gctgaaatct cagagaacaa tttgctttac 2400 
taagctgagt caacttgaga gcgaacttct aacaatgccg cactgtagtg tggctggttc 2460 
taccactatg actttaaaac atgtttatat catttttaat ttttatgata cggtagtgtc 2520 
agggagaaat gtaatgttct atatgaaatt cctttttcaa gtttgttcat taataacagt 2580 
tattaattta aatcagcgtt agagtttgtg ctgctgcaac tgctgtgaaa atttctctga 2640 
gtaattctga tttgtgaatg atcccagacc aaccctgaga ttttgtcaac ctgattaagt 2700 
CEiatatgaat gattaaaaag atgtgagsLac actgeiaaaaa aaaaa 2745 

<210> 18 

<211> 4763 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: 2530615CB1 

<400> 18 

gagagaaata gactcctctt ctgtcccttc taacccaggt ccctacccgt acccctcctc 60 
cttcctttcc ccgccccctc ctccctcctg gggcgagggg ggcctccctc cctctccccc 120 
ccttctctct ctccgagggg ggggtcaggg gagggagggg gggtccccca atcagcatgt 180 
ggctcctggc gttgtgtctg gtggggctgg ctggggctca acggggagga gggggtcccg 240 
gcggcggcgc cccgggcggc cccggcctgg gcctcggcag cctcggcgag gagcgcttcc 300 
cggtggtgaa cacggcctac gggcgagtgc gcggtgtgcg gcgcgagctc aacaacgaga 360 
tcctgggccc cgtcgtgcag ttcttgggcg tgccctacgc cacgccgccc ctgggcgccc 420 
gccgcttcca gccgcctgag gcgcccgcct cgtggcccgg cgtgcgcsLac gccaccaccc 480 
tgccgcccgc ctgcccgcag aacctgcacg gggcgctgcc cgccatcatg ctgcctgtgt 540 
ggttcaccga caacttggag gcggccgcca cctacgtgca gaaccagagc gaggactgcc 600 
tgtacctcaa cctctacgtg cccaccgagg acggtccgct cac aei a aa aa cgtgacgagg 660 
cgacgctcaa tccgccagac acagatatcc gtgaccctgg gaagaagcct gtgatgctgt 720 
ttctccatgg cggctcctac atggagggga ccggaaacat gttcgatggc tcagtcctgg 780 
ctgcctatgg caacgtcatt gtagccacgc tcaactaccg tcttggggtg ctcggttttc 840 
tcagcaccgg ggaccaggct gcaaaaggca actatgggct cctggaccag atccaggccc 900 
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tgcgctggct cagtgaaaac atcgcccact 
ttggttccgg ggcaggggcc tcctgcgtca 
ggctgttcca gaaggccatc gcccagagtg 
accagccgct caagtacacg cggctgctgg 
gtgctgaagc tgtggagtgt ctgcgccgga 
tgcagcctgc ccgctaccac atcgcctttg 
atgaccctga gatcctcatg cagcagggag 
tcaaccaggg agagggcctc aagttcgtgg 
ctgccagcgc ctttgacttc actgtctcca 
aaggcaagga tgtgcttcgg gagaccatca 
acaatggcga aatgcgccgc eiaaaccctgc 
caccagctgt ggccactgcc aagctgcacg 
ccttctacca ccactgccag gcggagggcc 
atgaactgcc ctatgtcttt ggcgtgccca 
acttctccaa gaatgacgtc atgctcagtg 
ccaagactgg ggaccccaac cagccggtgc 
ccaatcgctt cgaggaggtg gtgtggagca 
acataggcct gaagccacgc gtgcgtgaca 
tggagctcgt gccccacctg cacaacctgc 
tgcctcccta cgccacgcgc tggccgcctc 
ggcccccgcc gcctgccacc ctgcctcccg 
atgaccgctt ccccggggac tcacgggact 
tgggtgcctc cctcctcttc ctcaacatcc 
accggcggca ggagctgcgg tgcaggcggc 
tgcctggtgg gggccccctg ctccccgccg 
tggtgtcact gcagctgaag cggggtggtg 
gccctgcctg cccgcccgac tacaccctgg 
tcttggcccc cggggccctg accctgctgc 
cgcccccctc ccttcatccc ttcgggccct 
acaacaacac gctaccccac ccccactcca 
ccctcctccc cggccctccc tggcccggcc 
ctggcttttc tcctgtggag tcgtcacacg 
ttcctccctg cgatgcgtgt ctttcccacg 
ggcctttgaa caactggggg gcgttttctc 
tggaatgtgg tattttcccg cgtggaggtg 
tgtgcagggt gaggtttttt tttgccaccc 
ctgtggggat ttgtacccca gaatcctgtt 
tccctccccc tggagaccct ggaagtggtg 
accacagagg atggagcctg ggaagcagcg 
ccttgcccct accccggcga agcatgttcc 
gaagcacgtt ctgccgggga ggccctcacc 
tgggggaggg aggagtccac gcatcctgat 
ggatgcattt ctctgagtgg aaacaggttc 
acggcccctc tctt.cccagc acttccctgc 
tcctcctcac atggcaggtg agcacagact 
aaaccccgag ggaggcccgg cccttgctcc 
ccccccgcag aggccacgca tgtttgacca 
ttccccaggc ctcagagcat tgctcatccg 
aagactccca aaatgtgcca agaatttccc 
caagcaggat acagggcgag ggatgtggca 
cctcaccatg tctcccccac cctgcctcag 
ctttgaagct gtccccatct cagtgtcaga 
ctgaccgacg ccccctcctt gtctgaaaga 
ggggagaaag gtctcaccgg acaggttggg 
tggagggggc agtggggaca gggcttgggc 
gaggtgactc cccggagggc cttgggcttg 



ttgggggcga ccccgagcgt atcaccatct 9 60 
accttctgat cctctcccac cattcagaag 1020 
gcaccgccat ttccagctgg tctgtcaact 1080 
cagccaaggt gggctgtgac cgagaggaca 1140 
agccctcccg ggagctggtg gaccaggacg 1200 
ggcccgtggt ggatggcgac gtggtccccg 1260 
aattcctcaa ctacgacatg ctcatcggcg 1320 
aggactctgc agagagcgag gacggtgtgt 1380 
actttgtgga caacctgtat ggctacccgg 1440 
agtttatgta cacagactgg gccgaccggg 1500 
tggcgctctt tactgaccac caatgggtgg 1560 
ccgactacca gtctcccgtc tacttttaca 1620 
ggcctgagtg ggcagatgcg gcgcacgggg 1680 
tggtgggtgc caccgacctc ttcccctgta 1740 
ccgtggtcat gacctactgg accaacttcg 1800 
cgcaggatac caagttcatc cacaccaagc 1860 
aattcaacag caaggagaag cagtatctgc 1920 
actaccgcgc caacaaggtg gccttctggc 1980 
acacggagct cttcaccacc accacgcgcc 2040 
gtccccccgc tggcgccccg ggcacacgcc 2100 
agcccgagcc cgagcccggc ccaagggcct 2160 
actccacgga gctgagcgtc accgtggccg 2220 
tggcctttgc tgccctctac tacaagcggg 2280 
ttagcccacc tggcggctca ggctctggcg 2340 
cgggccgtga gctgccacca gaggaggagc 2400 
gcgtcggggc ggaccctgcc gaggctctgc 2460 
ccctgcgccg ggcaccggac gatgtgcctc 2520 
ccagtggcct ggggccaccg ccacccccac 2580 
tccccccgcc ccctcccacc gctaccagcc 2640 
ccactcgggt atagggggtg ggtggggagg 2700 
actccgaagg cagggaggag gacttggcaa 2760 
ccatccagca gcgctaaggt ggacatggga 2820 
cagagaagcc cagtctcttc tctggatctg 2880 
ccccccattg ggacaccagt cttcggtgtg 2940 
tgctttctca caacggggtg tgttttccca 3000 
tggacacatg ttggccccct caiaagaattt 3060 
cccccatccc ttctcccacc tcctcccctc 3120 
tgttcacata cagtgaccct tggccaccag 3180 
aggaaatcac agccccctcg cccctgcctc 3240 
ccccgacgcc ccccttggca caagt:cagat 3300 
ttccagagag gacagacaca gatttcctgc 3360 
gctgcctgga agcttatttt cccgtggcca 3420 
ttgcat^gtgg atgtgtgttt ccccaggcag 3480 
ctcccccagg cctcaggccc agcacccagt 3540 
tctagttggc aggagctgag gagggtgaac 3600 
cgagttgggg ggagggggtg tggcaacgtg 3660 
aagccctcat tgtggtccga ggacagcctt 3720 
tgccaaactg ggtaggtgga tttgagcgga 3780 
agtcccaggc agggcagggg aaactaaggg 3840 
ggtgaggggg ctcccgcctg tgccccttct 3900 
ttctccgttc cccttcatct ccgtccccct 3960 
ccagccttct cctcatctga ccaccctcct 4020 
aaggagcctt gaatggtgga gggaggcagt 4080 
agaatgaggt cagcggtgct ggggaacaga 4140 
agacaccagc aggaataatt tgaaatgtgt 4200 
ggcatttggg aaaagaatga tgtctggaag 4260 
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ggcttaaggg acacagtgga cgaggggaga 
tagtgccaaa cttgaatagg ggctggggtg 
ccctggtctt gagtccccag aactttgcct 
ccatggaaaa ttagttattt tctgatcctt 
agccatgccc tccaaatgct agagacctgg 
aattggggca tgggaggggg gctgggggac 
agctcctctc cccaaaacaa tcccgacaat 
ctgtacacat ttttaaacct ggcaziaagat 
ctgtteiaeiaa aaaaaaaaaa aaa 

<210> 19 

<211> 2970 

<212> DMA 

<213> Homo sapiens 



gtcctcatct gctggcattt tgtggggtgt 4320 
ctgtcttcca ctgacaccca aatccagaat 4380 
cttgactgtc ccttctcttc ctacctccat 4440 
tcccctgcct ggtctagctc ctctccaaac 4500 
gccctgaacc ctgtagacag atgccctcag 4560 
cccatgattc agccacggac tccaatgccc 4620 
cccttatccc taccccaacc ctttgcggct 4680 
gsiagagaata ttgtaaatat aaaagtttaa 4740 

4763 



<220> 

<221> misc^feature 

<223> Incyte ID No: 3883906CB1 



<400> 19 

cagctaggcg cgagcgaaaa caaacagctg 
gagcacgccg gcccagtccc ccacctgggg 
cgccaatgcc atcgccatcc tgaccgtagc 
gcaggacaac cgagcccacg cagcatcctc 
gaggtcggag cagctccgcg aggaccgcac 
gaaaggcttt gatgagaagg cctacctgtc 
ctacagacag cacgccttca accagctgga 
ccgggacacc cgccattaca gctgcccatc 
cagcgtcatc atcaccttcc acaatgaggc 
tgtcctgaac cgaactcctg ccaacttgat 
ctcagatccg gaagactgtc tactcctgac 
tgatcggcgg gaagggctga tccggtcccg 
cgttctcacc tttctggata gccactgcga 
gcagcgggtg aaggaggacc acacccgcgt 
ggataatttt gcctaccttg cagcatctgc 
gcatttcaag tgggagcaga tccctcttga 
gcccataagg acgcctgtca tagctggagg 
ccacttggga aagtatgatg cccagatgga 
cttcagggtg tggatgtgtg gtggcagtct 
tgtcttcagg aaacggcacc cctacsuactt 
gaatact.aag cgcactgcag aagtgtggat 
ccggccctcg gccatcggga aggccttcgg 
gaagatigaac tgcaagtcct tccgctiggta 
ccccgtgaag gaagcactcc ccggcatxrat: 
gggccagaac acagctggtg acttcctgct 
gaacccgcag cccgcccagg catggctgtt 
gtgcctggct gccacctcca ccttaatgtc 
gtgcaaccct agagaaggca agcagaaatg 
agtcagtggc ctctgcctgg agacaaagcc 
tgacgcccag gcccagcagt ggcagctgtt 
agggtcagct attgcactgt gctgggaggc 
cccagttcct tttcacagcc gctgagaaca 
tggaactgct cctgctatcc ccacaccctg 
gactgttggt accaggaccc tgggggtctc 
atacctacca caggagcccc ctgtctttca 
catggattct agagggtcta ggaatctctt 



gggctgcgag cgcccccgcc ccggccccga 60 
cgcccgtctg cccaccatga ggaagatccg 120 
ctggatcctg ggcactttct actacttatg 180 
cggcggccgg ggcgcgcaga gggcaggcag 240 
catcccgctc attgtgacag gaactccctc 300 
ggccaagcag ctgaaggctg gagaggaccc 360 
gagtgacaag ctgagcccag accggcccat 420 
tgtgtcctac tcctcggacc tgccagccac 480 
ccgttccacc ctgctgcgca cagtgaagag 540 
ccaggagatc attttagtgg atgacttcag 600 
caggatcccc aaggtcaagt gcctgcgcaa 660 
agtgcgtggg gcggacgtgg ctgcagctac 720 
agtgaacacc gagtggctgc cgcccatgct 780 
ggtgagtccc atcattgatg tcatcagtct 840 
tgaccttcgt ggagggttcg actggagcct 900 
gcagaagatg acccggacag accccaccag 960 
aatcttcgtg atcgacaagt cctggtttaa 1020 
catctggggg ggagagaatt ttgagctctc 1080 
ggagatcgtc ccctgcagcc gggtgggcca 1140 
ccctgagggt aatgccctca cctacatcag 1200 
ggatgaatac aagcaatact actatgaggc 1260 
cagtgtggct acgcggatag agcagaggaa 1320 
cctggagaac gtctacccag agctcacggt 1380 
taagcagggg gtgaactgct tagaatctca 1440 
tggaatgggg atctgcagag ggtctgccaa 1500 
cagtgaccac ctcatccagc agcaggggaa 1560 
ctcccctgga tccccagtca tactgcagat 1620 
gaggagaaaa ggatctttca tccagcattc 1680 
tgcccagctg gtgaccagca agtgtcaggc 1740 
gccacacaca tgacggaggg attttcttga 1800 
tggctgtggc tcctgtttgt gaagagaact 1860 
catccacaca tctaccccac ggccttgtct 1920 
cttcctcacc ccactggctc ttgggcaaat 1980 
ccctacaaag cagaccagcc tggggcagat 2040 
taggccaagg atccacatac ccatagagtt 2100 
gaaattgtcc ctaaaatact gcatgtgtgt 2160 
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PCTArSOl/47429 



gcatacatgc attttcctgg ggaaaaagtc catggcttct aaaagaggtc cctgatcccc 2220 
aaagggcctg aacctctgct ctggattgaa gccactgtct gcccaagctg ccacccccta 2280 
atcttcctcc ctggcgtgct caaactgcca tcgcctgctc cctccacacg gcccttgggg 2340 
tcagcagcca agtgtctgtg gtccgcagca cctgctctgg agggctctcc agcagtttcg 2400 
cctcctgact ctcaccagcg tcctctcggc agcactcccg tgcccaagtg cacccctctg 2460 
gacttgccag cgcaggcccc ttcgctccca gggccatgct ttgcctgtcc tctgtcgtga 2520 
tgtttcttcc gcagccaggt gcagcctcag atcccctgtt catctgggaa gcctcctgcc 2580 
acagcttagg acagaactgg gcccagggca aaggcctctc cccagaggat ggactagaag 2640 
gcctgggcca cactcgggca tggacctttg gggctgggga gcaggggctg cgcctgttga 2700 
atgtaagagg actgctgacc agagggcctt caagagggtc tctctgtcct ttgctgtggt 2760 
cagatcaggc tctgcactta tcagccggtc ctttgtggca acgcagccct gttctgtttt 2820 
tgcttttcct cttcttgacc aaagcatgtg ccactagctg tccttgagga cctcgtcttt 2880 
atgaaacaca cacctggaat aaaaccactt cttacatgtc caaaaa a a aa aa a aaaaaaa 2940 
aaaaaaciaeia aaaaaaaaaa aaaaaaaaaa 2970 

<210> 20 

<211> 834 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> inisc_feature 

<223> Incyte ID No: 7473644CB1 „ 
<400> 20 

atgtcgtcct gcagccgcgt ggccctggta actggggcta acaaaggcat cggctttgcg 60 
atcacgcgtg acctgtgtcg gaaattctcc ggggacgtgg tgctcacggc gcgggacgag 120 
gcgcggggcc gcgcggcggt gcagcagctg caggcggagg gcctgagccc acgcttccac 180 
cagctggaca tcgacgaccc gcagagcatc cgtgcgctgc gcgactttct gcgcaaggag 240 
tacgggggac ttaacgtgct ggtcaacaac gcgggcaticg cctttagaat ggatgaccca 300 
acacccttcg acattcaagc tgaggtgaca ctgaagacga acttttttgc cactagaaat 360 
gtctgcactg agttactgcc tataatgaaa ccacatggca gagtggtgaa tatatcaagc 420 
ctaataagtc tagaggccct gaaaaactgc agcctggagc tacagcagaa gtttcgaagt 480 
gagaccatca cagaggagga gctggtgggg ctcatgaaca agtttgtgga ggatacaaag 540 
aaaggagtcc atgcaaaaga aggctggcct aatagtgcat acggggtgtc taagattgga 600 
gtgacagtcc tgtccagaat ccttgccagg aaactcaatg agcagaggag aggggacaag 660 
atccttctga atgcctgctg ccctggctgg gtcagaaccg acatggcagg accacaagcc 720 
accaaaagcc cagaagaagg agcagagacc cctgtgtact tggccctttt gcctccagat 780 
gcagagggac ctcatgggca gtttgttcaa gataaaaaag tggaacaatg gtga 834 

<210> 21 

<211> 2466 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 7485303CB1 

<400> 21 

atggtaaatg gcttgtttct cttctccctc tacacacccc aggccccgtc tatgagctct 60 
aaaaacaacc ttgaagcctt cagaaggcct gagcagcgag gccaccgggc cgcgcgctcc 120 
cagcttcgct cggacgcggc ttcggcccgc agagggttcg tggcccggac gcggcgagag 180 
ctgggcccag gacggtgcgt ccggcctcgc ccgcggctgc tcgcaccaac aagtttgaac 240 
aatgatcacc gtcaaccccg atgggaagat aatggtcaga agatgcctgg tcaccctgag 300 
accctttcgg ctttttgtcc tgggcatcgg cttcttcact ctctgcttcc tgatgacgtc 360 
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cccggaaagc acgaccgcta caaaggtgct 
gtcgtgttca ttaccgccac gggcatggat 
agacagcgac aacagaaaac tggcgaggca 
tggcgtagcg tgcatgagca ggtctctgat 
gcgtgcgcag gctttgttct gtggttgctg 
cgcgctcgtc ggcatcaatt gcactgcaaa 
ttgaatttgc ccaaggctcg cccatgacca 
ctgcgcagat cctgaccttt atcactgaac 
tgatcgccag cgtggtggac atcgagcgca 
gcctgatctg cttgcgcgac ggcttgccga 
cgacgtggct gggcagtaac tgcctgtacc 
gcggcggcgg ggcgggcaag aagttgttgc 
gttgcgggcg ctttgaatgg agcgtgctgg 
aatccatcgg cgcccagcct caggaagaat 
tgcgcgactt cgccctgggc tgaaacctct 
agcg 



gagcatcgcc gtggcaatga cactgcccgc 600 
cagcagtgac aacaaaaagg tcgagccgta 660 
gaacagcgcg aatgtcttcg tgcccacggg 720 
cgttggattt atgctcattg tttaccatgg 780 
atgcttgcat caccccgcgc cggagtgaaa 840 
cggccatcaa accgtcttag tatgtggcct 900 
tcgagattcg ccccgccgtg cccagcgatg 960 
ttgccgagta cgagaaagct cggcatgaag 1020 
gcctcttcag cgaaggcgcc acggcccatg 1080 
ttggctttgc ggtgttcttt ttcagttact 1140 
tcgaagacct gtacatcaac ccggagcaac 1200 
gccacttggc caagatcgcc tttgacaatg 1260 
actggaacga gccagcgatt gccttctaca 1320 
gggtacgtta tcgcatggaa ggcgatgccc 1380 
ttcgtatagt cactaaagat cgaatgtggg 1440 

1444 



<210> 23 

<211> 2232 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> mis cofeature 

<223> Incyte ID No: 6538080CB1 



<400> 23 

atggtggtgg tgcctttcac aattccattc 
aacagcccct ggagaatgac tgtagactac 
atagctgctg tgctaggtgt ggcatccctg 
gagaacaggg agagcctatc aaaggtctgt 
tgtgctacta atggggacat caaggtccaa 
aacaccaggc tgggatggat tcagggcaag 
gtgeiacgtgt tcctcggagt cccctttgct 
aacccgcagc ctgcatcgcc ctgggataac 
tgcctccaga actcagagtg gctgctctta 
aaattcggag tgtcagaaga ctgcctctac 
acaggctcca agctccccgt cttggtgtgg 
gcctccatct ttgatgggtc cgccctggct 
cagtaccggc taggaatatt tggtttcttc 
tgggccttca aggaccaggt ggctgctctg 
ggtggggacc ccagctctgt gaccatcttt 
agtcttatac t.gtctcccat ggccaaaggc 
gtggccatca t^cccttacct ggaggcccat 
gttgcacatt tctgtggtaa caatgcgtca 
acaaaaccct ccaaggagct gctgaccctc 
gttgatggtg ctttctttcc taatgagcct 
gcaattcctt ccatcatcgg agtcaataac 
gaggctcctg aggtcctcag tggctccaac 
atcctgcaca tcccgcctca gtatttgcac 
cactccctga ctgaaatccg agacagtctt 
gtccctgcac tigatcacagc tcgatatcac 
gagttt.cggc accggcctca gtgctttgaa 
cacgctgatg aagtccgctt tgtgttcggt 
ttcgaaggag ccacggagga ggagaagtta 
acctttgctc gaaccgggaa tcctaatggg 



gattcatcag tctggctgct gcagaagttg 60 
cacacactca accaagcggt aactctgatc 120 
cttgagcagg gtttcagcag agcaatctgg 180 
gtctcccagg ggccatccag gccactggcc 240 
ggagggcctt ctgctgaagg gccacagagg 300 
caagtcactg tgctgggaag ccctgtgcct 360 
gctcccccgc tgggatccct gcgatttacg 420 
ttgcgagaag ccacctccta ccctaatttg 480 
gatcaacaca tgctcaaggt gcattacccg 540 
ctgaacatct atgcgcctgc ccacgccgat 600 
ttcccaggag gtgccttcaa gactggctca 660 
gcctatgagg acgtgctggt tgtggtcgtc 720 
accacatggg atcagcatgc tccggggaac 780 
tcctgggtcc agaagaacat cgagttcttc 840 
ggcgagtccg cgggagccat aagtgtttct 900 
ttattccaca aagccatcat ggagagtggg 960 
gattatgaga agagtgagga cctgcaggtg 1020 
gactctgagg ccctgctigag gtgcctgagg 1080 
agccagaaaa caaagtcttt cactcgagtg 1140 
ctagatctat tgtctcagaa agcatttaaa 1200 
cacgagtgtg gcttcctgct gcctatgaag 1260 
aagtcccttg ccctccatct gatacaaaac 1320 
cttgtggcta atgaatactt ccatgacaag 1380 
ctggacttgc ttggagatgt gttctttgtg 1440 
agagatgctg gtgcacctgt ctacttctat 1500 
gacacgaagc cggcttttgt caaagccgac 1560 
ggtgccttcc tgaaggggga cattgttatg 1620 
ctgagccgga agatgatgaa atactgggct 1680 
aacgacctgt ttctgtggcc agcttataat 1740 
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tctgggaggc cagttctcgg cccggcgcct gggggactcg ccattcacca tccgcacaga 420 
aggatgcccc ctggggccgg cctcatggag cggatccagg ctattgccca gaacgtctcc 480 
gacatcgctg tgaaggtgga ccagatcctg cgccacagtc tgctcctgca cagcaaggtg 540 
tcagaaggcc ggcgggacca gtgtgaggca cccagtgacc ccaagttccc tgactgctca 600 
gggaaggtgg agtggatgcg tgcccgctgg acctctgacc cctgctacgc cttctttggg .660 
gtggacggca ccgagtgctc cttcctcatc tacctcagtg aggtcgagtg gttctgcccc 720 
ccgctgccct ggaggaacca gacggctgcc cagagggcac ccaagcccet ccccaaagtc 780 
caggcagttt tccgaagcaa cctgtcccac cttctggacc tgatgggcag cgggaaggag 840 
tccctgatct tcatgaagaa gcggaccaag aggctcacag cccagtgggc gctggctgcc 900 
cagcgcctgg cacagaagct gggggccacc cagagggacc agaagcagat cctggtccac 960 
atcggcttcc tgacggagga gtccggggac gtgttcagcc ctcgggtcct gaagggcggg 1020 
cccctagggg agatggtgca gtgggcggac attctgactg cactctatgt cctgggccat 1080 
ggcctgcggg tcacagtctc cc tgaaggag ctgcagagta acttaggggt accgccaggc 1140 
cggggaagct gcccgctcac catgcccctg cccttcgacc tcatctacac cgactaccac 1200 
ggcctgcagc agatgaagcg gcacatggga ctctccttca agaagtaccg gtgccgaatc 1260 
agggtcatcg acaccttcgg gacggaacct gcgtacaacc acgaggagta cgccacgctg 1320 
cacggctacc ggaccaactg gggctactgg aacctcaacc ccaagcagtt catgaccatg 1380 
tttcctcata cccccgacaa ctccttcatg ggcttcgtgt ccgaggagct caacgagacg 1440 
gagaagcggc tcatcaaagg cggcaaggcc agcaacatgg ccgtggtgta cggcaaggag 1500 
gcgagcatct ggaaggggaa ggagaagttc ctgggcatcc tgaacaaata catggagatc 1560 
catggcaccg tgtactacga gagccagcgg ccccccgagg tgccagcctt tgtgaagsiac 1620 
cacggcctct taccgcagcc tgagtttcag cagctgctgc gcaaggccaa actcttcatc 1680 
gggtttggct tcccctacga gggccccgcc cccctggagg ccatcgccaa tggttgcatc 1740 
ttcctgcagt cccgcttcag cccgccccac agctccctca accacgagtt cttccgaggc 1800 
aagcccacct ccagagaggt gttctcccag catccctacg cggagaactt catcggcaag 1860 
ccccacgtgt ggacagtcga ctacaacaac tcagaggagt ttgaagcagc catcaaggcc 1920 
attatgagaa ctcaggtaga cccctaccta ccctacgagt acacctgcga ggggatgctg 1980 
gagcggatcc acgcctacat ccagcaccag gacttctgca gagctccaga ccctgcccta 2040 
ccagaggccc acgccccgca gagccccttt gtcctggccc ccaatgccac ccacct:cgag 2100 
tgggctcgga acaccagctt ggctcctggg gcctggcccc ccgcgcacgc cctgcgggcc 2160 
tggctggccg tgcctgggag ggcctgcacc gacacctgcc tggaccacgg gctaatctgt 2220 
gagccctcct tcttcccctt cctgaacagc caggacgcct tcctcaagct gcaggtgccc 2280 
tgtgacagca ccgagtcgga gatgaaccac ctgtacccgg cgttcgccca gcctggccag 2340 
gagtgctacc tgcagaagga gcctctgctc ttcagctgcg ccggctccaa caccaagtac 2400 
cgccggctct gcccctgccg cgacttccgc aagggccagg tggccttgtg ccagggctgt 2460 
ctgtga 2466 



<210> 22 

<211> 1444 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 6970969CB1 



<400> 22 

aacagtgctg ccaagtttgg 
gcagccgccc agtccgacca 
gcttgcgcgc caggaaatcc 
catgcgccag aacatcacca 
tgtcaggggc aggtccagcg 
gctCcLacata ctgcccgaga 
cagcgtcacg atgatcgggc 
tccaaacagc atcaagcctt 
caccaacaat acgccggtgc 



cctaaggtag cgactggaac 
tcaggatcgc gaacgccgcc 
tggcggttgg cacattcgcg 
ccccggcgag gctggcgagc 
cagtggcggc gcgtcccagc 
gcaaggcgaa atatttggcc 
ccagggtata aaacacgccc 
ggccaaaccc tggcgtcagg 
ttgcaatggc ccgcgccgcg 



gaagacagcc ccccgaatac 60 
ttggaagaca ggacgcggcg 120 
tgctgtcgca accgagtgcc 180 
ccggcgatat agaacgcggt 240 
aaagggccgg aacctatgcc 300 
cgctggcgcg gttccaccag 360 
cagcccaaac caagagacaa 420 
gcaaacccca ggcaggccac 480 
cccagcgcgt cagccagatg 540 
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ctgactgagc agtacctcca gctggacttg 
ccgcgggtgg agttttggac cagcaccatc 
cacagtcctc tttcttcctt aactttcctc 
gctccttgag aagttatctt tctgtgattt 
cgcaatcatt agcttctttc tgagctcagc 
ctgctttcgc tgatatttta tggacttagg 
aaaaagtgca atttgtcttg gaaggcaaca 
ggcctattag ttgtcataat aatggttttg 
aataggtaaa aa 



aacatgagcc tcggacagag actcaaagaa 1800 
cccctgatcc tgtctgcctc cgacatgctc 1860 
tctctcctcc agcctttctt tttcttttgt 1920 
tggtttccct tctcctccca taatttctcc 1980 
tgctttctat ggggatcctt gcaaaacaag 2040 
aatgatcctt acagaattct tttcaacatc 2100 
agatttcttc aataaatttg gaagagggct 2160 
teiactcatat gaaataaaat cagsiatgtaa 2220 

2232 



<210> 24 

<211> 816 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc^feature 

<223> Incyte ID No: 55048919CB1 



<400> 24 

tggatagcgg ctttgacgat cctgggactc 
ttccgcgccg aaagccgtca agagcaacat 
ggtgaccttg accaacggtc gcagtggcgt 
gcctggctgg cacgccatgc atttccatgc 
gaaatcgggc gcgcatatca accacgagga 
ggaagggccg gactttgggg aattgccgaa 
tgccgaagcc ttctcggcac ttgtctcgct 
cgatgccgat gggtccgccc tggttattca 
gatcggcggg gcgggcgcac gcgttgcgtg 
agcacgggac gcgtgcggtc cgccgatcga 
aggtgcgcct tagccacgcc gtgttcggcg 
acgcgccgca aaagcggcga cccagaaaat 
aatttagaag gtaggataat atgatcataa 
cgcagatcaa ccagtaagtg gagaagatga 

<210> 25 

<211> 922 

<212> DNA 

<213> Homo sapieos 



gcaacggcaa ccgcggcgat tcgcgcaaac 60 
cgttcaggcc ggcggcggca agatcggaga 120 
gcttctgcgt ctgacggcgt ccgggctgac 180 
gactggcgat tgctccgacc aaggcttcca 240 
tcacaagaca cctcacgggc ttctcaatcc 300 
tattcatgtg gcggcggacg gcaccgtcaa -360 
ggacgcagcg tcatcccggc cgaaccttct 420 
cgccagcccc gatgaccatg tcacgcaacc 480 
tgccgttatc cggtaagcgc cagcataccc 540 
ccgcatggcg gatttttaac gaattcggcg 600 
cgaccaatgg atggagagaa taacgcgttt 660 
cagcaacttt ccggcccgca acttgagtga 720 
tcaacccgcg ctcagcttga tatcttccgg 780 
cgacat 816 



<220> 

<221> misc_feature 

<223> Incyte ID No: 7485135CB1 

<400> 25 * 
atggcagaga agcccaagct ccactactcc 
tggctcctgg ctgcagccgg agtagagttt 
ttggaaaagt taagaaatga tgggagtttt 
gatgggatga agctggttca gaccagagcc 
ctctatcgga aagacataaa ggagagagtc 
gatctgggtg aaatgatcct tcctctgccc 
cttgccttga accaacagag aacaaaaaat 
aagagcaatg gacaagacta ccttgttggc 
gtggaacttc tctactacat ggaagagctt 
ctgaaggccc tgaaaaccag aatcagcaac 
ggcagcccaa ggaagtctct catggatgag 



aatgcacagg gcagaatgga gtccacctgg 60 
gaagagaaat ttctagaatc tgcagaagat 120 
atgttccaac aagtgccaat ggttgagatt 180 
attctcaact acgttgccag caaatacaat 240 
ctgattgata tgtacacaga aggtatagca 300 
atatgtcaac ctgaggataa agacaccaag 360 
cgctatttcc ctgcctttga aaaagtctta 420 
Eiacaagctga gctgggctga cattcacttg 480 
gactctagcc ttatcttcag cttccctctg 540 
ctgcccatgg tgaagaaatt. tctacagcct 600 
aaatctttag aagaagcaag gaagattttc 660 
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aggttttaat aacgcagtca 
caacaataaa gtactttacc 
tcaaattata tgctaattaa 
gttttcatta ggatctgatg 
ctggaattaa aaattcagta 



tggaggccaa gaacttgcaa 
taagtgttga ttgtgcctgt 
ataatacaac tcctattcgc 
tgaattcaga tttccaatct 
aa 



taccaatgtt ctaaagtttg 720 
tgtgaagcta atgaactctt 780 
tgacttagtt aaaattgatt 840 
tctcctagcc aaccattttc 900 

922 



<210> 26 

<211> 1084 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> mlsc_feature 

<223> Incyte ID No: 7684978CB1 



<400> 26 

atcggccgca tggtcgccag tggcctgggt 
cagatgcagg aattgggcgc ccgctgcgtg 
atcggcgtga ttgccctggc cgatcacaaa 
gtactgcttt cccataccca gactccggag 
gcagcgtgcc ggccattggc caaggcaccc 
ccgccgaagt ggccgccctg caatacggta 
ccgaaatgta tgccgaaggc ggcgccgaga 
gtgatcaggt gtttctggtt agcaaggtgt 
ctgaagcctg cgaacgcagc ctgaagcgcc 
tgcactggcg tggccagtat ccgctggagg 
agcagggcaa gattcgccac tggggggtat 
tgcacaaccc ggattgcgcc accaatcagg 
aattcgacct gttgccgtgg agccgccaac 
tggcccaggc tggccagctg ttgcagcacc 
gcgcaaggcc ggcgcaggtc agcctggcct 
ttcccaaggc ggtgtcatca gaacacgtgc 
tgacaagtga agacttgcgg gcgatcgacc 
atttggcgat ggtctgactc aagctcagtg 
cage 



gtgagcgcgg tgccggcgct gtgcatcaac 60 
acgctggtcg aacccgccgt cgaacggcgt 120 
ctttccaccg cagcccaggc gctgctcgag 180 
gtgacatgcg ttacgtgaaa ctggcaggtt 240 
ggtacatggg cgaagacccg gcgcgcaagg 300 
tcgaacaggg catgaccctg atcgacagtg 360 
aagtggtcgg ccaagccatt gccgggcgcc 420 
acccgcacaa cgccagccgc cggggtgtgc 480 
tgggtaccga ggtgatcgac ctttatctgc 540 
aaacggtcga ggcgttcgag cgcttgcgcg 600 
ccaacttcga tgtcgatgac ctgcgcgaac 660 
tgctgtacaa cccggcgcag cggggcattg 720 
ggggcttgcc gaccatggcc tactgcccgc 780 
cggtgctgag cgagattgcc gaacgccatg 840 
gggtgacacg ggacaatgat gtgattgcca 900 
ggctgaatgc ggctgccggg gcgttgaccc 960 
gggcgtttcc cgcaccaacg cgcaaacagc 1020 
aaagttccgg ctctttacat ccagcccctg 1080 

1084 
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